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PREFACE 


Durinc the second World War, scientific methods of personnel 
selection were applied in the Royal Navy, the Army and A.T.S., 
and the Royal Air Force on a scale quite unprecedented in 
Britain. This development has naturally aroused widespread inter- 
est, and a desire for further information beyond that provided by 
articles in the popular Press or in technical psychological journals. 
The present book attempts, therefore, to give an account of the pro- 
cedures used and the results obtained. It is addressed both to the 
industrialist or educationist who hopes that the methods may be of 
value in the peace-time selection of employees, students or pupils, 
and to the student of psychology or to the personnel officer who 
wishes to know what psychological discoveries were made and 
what techniques were found most valuable. If for this reason it 
falls/between two stools, we must apologise; it is not easy to write 
a simple, readable and interesting description for the former and 
at the same time to give the latter all the technical details that he 
needs, Statistical terms, formulz and calculations have been kept 
to the barest minimum and will, it is hoped, be published elsewhere. 

Our primary aim has been to describe such novel contributions 
as were made in the Fighting Services. But although space has not 
allowed a comprehensive survey of peace-time selection and guid- 
ance, we have tried to view our methods and results in this broader 
setting, and to integrate them with recent developments in civilian 
vocational and educational psychology. 

After an historical introduction, Part I is concerned with the 
organisation of selection, the general procedures employed, and the 
work of psychologists in the Royal NavygeApmy and A.T.S., and 
the R.A.F. Part II takes up the principles of selection and guidance 
which have evolved both from pre-war investigations and from, 
war-time experience. It reviews the techniques of interviewing and 
testing, together with evidence of their merits and defects, in some 
detail. Each chapter is preceded by a general summary or abstract, 
and the main conclusions relevant to peace-time work are brought 
together in the final chapter. 

It should be made clear that two important topics are almost 
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wholly neglected. First, we have not been able to cover the work 
on selection problems by psychologists outside the British Fighting 
Services during 1939-47. Much was done, for example, by mem- 
bers of the Cambridge University Psychology Department, Uni- 
versity College (London), and by other teams and individuals. 
Again, to attempt to describe the even more extensive develop- 
ments of personnel selection in the American Forces would shave 
been quite impossible at the time of writing. Secondly, we have not 
tried to deal with military applications of psychology apart from 
selection—for example, studies of training methods, of the design 
and layout of equipment, of the effects of conditions of warfare on 
morale, and so forth—whether conducted by psychologists inside 
or outside the Services. It may safely be stated that at least as 
valuable contributions to the efficiency of the Forces can be, and 
have been, made by psychologists in such fields as by those con- 


cerned with selection and guidance. The same, of course, is true in 
industry and education, 
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first the naval, milita: 
secondly, the numb 


two appendices which list, 
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ers or abbreviated titles of the main psycho- 
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articles, which are listed in the bibliography at the end of the book. 

We wish to express our thanks to Mr. Alec Rodger, Col. B. 
т. В. S. Morris for reading 
and for their suggestions and 


il, but the responsibility for 


any statements of fact or opinions rests solely with the authors. 


п 


CONTENTS 


PREFACE . А B 3 s 5 
Part I 
THE RISE OF VOCATIONAL PSYCHOLOGY . + мт 
PERSONNEL SELECTION IN THE ROYAL NAVY . M 25 
OTHER RANK SELECTION IN THE ARMY AND A.T.S. . 39 
-ARMY OFFICER SELECTION + А s 52 
SELECTION IN THE ROYAL AIR FORCE . һ 67. 
Part П 
GENERAL PRINCIPLES OF VOCATIONAL CLASSIFICATION . 85 
THE VALUE OF VOCATIONAL CLASSIFICATION PROCEDURES 103 
THE BIOGRAPHICAL QUESTIONNAIRE i S105 72101 
THE INTERVIEW AND ITS VALIDITY c E л 143 
PRINCIPLES OF PSYCHOLOGICAL TESTING . é 065 
EFFECTS OF PRACTICE, AGE AND OTHER FACTORS ON 

TEST SCORES . Ё A : 4 4 . 181 
GENERAL SURVEY OF THE VALUE OF VOCATIONAL TESTS . 204 
VERBAL INTELLIGENCE AND EDUCATIONAL TESTS 7118220 
NON-VERBAL AND MECHANICAL TESTS  . d 1234 
SPECIAL APTITUDE, AND TEMPERAMENT, TESTS . 247 
MAIN R.A.F. SELECTION FINDINGS . 2 ў . 261 
CONCLUSIONS . : Б : 2 д и 281 

Appendices i \ 
ABBREVIATIONS EMPLOYED IN THE TEXT OR IN APPEN- 

DIX и. à А ` E К ^ - 293 
THE МАІМ! PSYCHOLOGICAL TESTS USED IN THE FORCES 295 
BIBLIOGRAPHY . H 3 i : 3 + 302 
INDEX . . : : д 3 5 А © 313 

7 


m 
Un 


* PERCENTAGE OF WASTAGE RE 


+ PREDICTIVE VALUE OF A CLERICAL APTITUDE TEST 
. NINETIETH, FIFTIETH AND TENTH PERCENTILE T2 SCORES 


. SQUARES TEST, 


* PERCENTAGE OF PILOT 


LIST OF DIAGRAMS 


CHART INDICATING THE MAIN ORIGINS AND THE COURSE 
OF DEVELOPMENT OF VOCATIONAL PSYCHOLOGY 


+ SCATTERGRAMS OF CORRELATION COEFFICIENTS WITH 


SELECTION RATIOS OF 50 PER CENT. AND 20 PER CENT. 


MAINING WITH DIFFERENT 
SIZES OF CORRELATION . 


OF MEN EMPLOYED IN р 


IFFERENT BRANCHES OF THE 
NAVY. 


FIGURE CONSTRUCTION TEST 
PAPER FORMBOARD CEST . 


MECHANICAL COMPREHENSION TEST ITEMS 


TRAINING WASTAGE BEFORE GRAD- 
ING E А 


MARES; APRIL, 1942, FIRST ТОО 
EACH SCHOOL . 


SECOND FLIGHT TEST 
CADETS GRADED АТ 


13 
129 


130 
179 


211 
235 
237 
237 
238 
241 


263 
264 
264 
265 
270 


271 


272 


PART I 


The organisation of selection, the 
general procedures employed and 
the work of psychologists in the 
Royal Navy, Army and A.T.S., 
and the Royal Air Force. 


CHAPTER I 
THE RISE OF VOCATIONAL PSYCHOLOGY 


Abstract.—Psychology and the other social sciences can help in the 
solution of many problems of the modern world, both in war and 
peace. The scientific approach is particularly valuable whenever 
prediction of human behaviour enters, as in educational and voca- 
tional selection. The chart on page 13 indicates the main origins 
and the course of development of vocational psychology, which 
are described in this chapter. 


It is a truism that the solution of the problems of individuals and 
of societies in the modern world depends more on progress in the 
social sciences, including psychology, sociology and economics, 
than on further advances in the physical and biological sciences, 
engineering and medicine. The importance of psychology becomes 
particularly obvious during periods of emergency such as the two 
World Wars, since the need for making the most effective possible 
use of human as well as of mechanical resources is then realised. 
It is the object of this book to describe the developments that 
occurred in war-time in applying psychology to the classification of 
British manpower, and especially of mind or brain power.* 
Scarcely less urgent, however, are the problems of adjusting the 
individual in peace-time to the educational and industrial systems 
of the society in which he lives, and of adjusting these systems to 
the legitimate demands of individuals. Hence we shall attempt to 
show how far experience gained in the guidance and selection of 
recruits may be carried over into civilian practice. 

Now the social sciences are at the same time the most immature 
and the most intractable of all the sciences precisely because they 
are concerned with man. Individuals differ too widely in their 
tastes, talents and temperaments to be readily amenable to scientific 
analysis and control. As each one is usually convinced that he knows 

* This conception of “human engineering” is borrowed from Yoakum and 


Yerkes’ book on mental tests in the American Army (1920), and Yerkes’ memo- 
randum to the U.S. Secretary of War (1941). 
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what is best for himself and his neighbours, he resents being 
studied, classified and regulated by psychologists and economists. 
He fears that this involves forfeiting his free choice and being 
directed in all his doings from birth to the grave by an army of 
scientific experts—a prospect even more abhorrent than his present 
subjection to teachers, employers and civil servants. This suspicion 
however, is hardly warranted, for several reasons. 

First, the psychologist admits, more readily indeed than does the 
opinionated layman, that he is not omniscient about human nature. 
He prefers not to pronounce definite decisions except in those rare 
instances where he has strong scientific proof of their validity. In 
fact, his caution is often apt to seem rather irritating. Secondly, 
the good psychologist recognises and allows as much scope as 
possible for individuality. He never willingly dictates a course of 
action, but attempts rather to clarify the situation confronting the 
individual, and to draw attention to all the relevant factors, so that 
the individual may himself arrive at a wise and satisfying solution. 
(His method of tackling problems within a group or organisation 
such as an industrial firm, or the army, is similar. It certainly does 
not approximate to psychological totalitarianism.)* Thirdly, it 
must be admitted that man’s freedom is already greatly circum- 
scribed. Decisions are made over his head by his parents, teachers 
and employers, usually on much more arbitrary and fallible 
grounds than those which the psychologist, with his specialised 
training and ingenious techniques, takes into account. Expressed 
in another way, the life-history of an individual is punctuated by 
a series of predictions. When his mother smacks him, she predicts 
that this will make him behave better in future; his success or 
failure in the Special Place examination predicts his suitability or 
unsuitability for secondary education; similarly, his acceptance or 
rejection for a job, or in marriage, are essentially prophecies. 
Psychological investigations have proved that many of these pre- 
dictions are incorrect, and that they are potent sources of unhappi- 
ness, delinquency and neurosis. Predictions by psychologists 
themselves are certainly not perfect, but they are constantly being 
checked and improved. In many fields, such as the upbringing of 
children, in educational and vocational guidance and selection, 
their superiority to the haphazard and unsystematic predictions 


* cf. the articles on psychotherapy, i i : А Fee, 
= ру, industrial tions 
by members of the Tavistock Clinic, Y ques 24 ad oo ag 
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of everyday life has been demonstrated. In others, such as 
international relations and economic policies, psychologists 
are convinced that scientific investigations would help, but are 
seldom so rash as to claim that their knowledge is sufficiently far 
advanced to provide immediate remedies for the country's or the 
world's ills. 


Historical Outline 


Only a bird's eye view will be attempted here of the development 
of the branch of applied psychology with which we are concerned. 
Viteles (1932) provides an excellent history of industrial psycho- 
logy; a Board of Education Report (1924) describes the rise of 
mental testing, and Flugel (1933) outlines the wider background. 
Until halfway through the nineteenth century, psychology was an 
entirely theoretical subject—a branch of philosophy—and the 
foundation of the first laboratory at Leipzig by Wundt in 1879 
epitomised its conversion into an experimental science of human 
thought and behaviour. Such work as that of Helmholtz on vision, 
hearing and reaction time (quickness of response), and Galton's 
fertile explorations of individual differences in intellect and mental 
1magery (Inquiries Into Human Faculty, 1883), contributed to this 
new outlook. Among Wundt’s many pupils were the Americans, 
1. McK. Cattell, who in the 1890’s investigated the wide range of 
differences on simple tests of sensory and motor capacities, and 
Stanley Hall, who fostered some of the first systematic studies of 
childhood. Contemporaneously the nineteenth century witnessed 
a great increase in humanitarian attitudes towards the insane and 
mentally defective, and in education and in industry, and these 
stimulated the demand for better methods of diagnosing and ` 
predicting human qualities and capacities, 


Education 


qualifying procedures at about this time. While 
marked a great advance over oral tests by 
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inspectors in schools and “disputations” in the universities, later 
work has revealed their defectiveness as selection techniques 
owing to unreliability and subjective standards of marking (cf. 
Vernon, 1940). The first tests of intelligence, namely Ebbinghaus’s 
Completion Test and Binet’s famous scale, were constructed at the 
request of the education authorities of Breslau and Paris respec- 
tively at the turn of the century. These tests and their numerous 
progeny made more accurate the segregation of mentally defective 
children for special education, also the discrimination between 
backward children who are innately dull and those whose retarda- 
tion is due to physical, emotional or other causes. Tests of attain- 
ment in spelling, reading and arithmetic, and scales for grading 
hand-writing and composition, were constructed by Thorndike, 
Courtis and others in America around 1910. These were stan- 
dardised so that a child’s performance could be compared with the 
average for his age, and accurate surveys could be made of the 
relative achievement of classes or schools. Thorndike also estab- 
lished the pattern for scientific studies of teaching methods. The 
superiority or inferiority of a new device over stock methods could 
be demonstrated by the progress made among comparable groups 
of children, instead of by personal impression and arguments based 
on tradition. In 1913 Burt was appointed by the London County 
Council as first psychologist to an education authority and initiated 
his series of researches into intelligence and attainment tests, into 
delinquency and backwardness, and into statistical methods of 
analysing psychological data, which formed the basis of so much 
subsequent British work in applied psychology. 

We must here retrace our steps and mention the important 
contributions to the conception of mental measurement of math- 
ematicians such as Quetelet, who demonstrated the applicability 
of the normal or Gaussian frequency curve to human character- 
istics, and Galton and Pearson whose technique of correlation 
made it possible to measure the extent of agreement between scores. 
on two or more tests or examinations (cf. p. 104). From this 
evolved factor analysis, which is used for classifying the main 
types of human abilities. In early writings on psychology and 
education frequent reference is made to faculties or mental powers 
such as attention, memory, observation, etc. Spearman's researches 
from 1904 onwards largely discredited these and demonstrated the 
importance of “g”, the general intellectual factor, underlying all 
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our abilities. Burt and Thomson in England and Thurstone and 
others in America greatly extended factorial techniques and 
demonstrated the existence of additional factors or types of ability 
—verbal, numerical, mechanical, spatial, etc. 

Educational psychology was profoundly influenced again by 
nineteenth-century developments in evolutionary biology and in 
psychopathology. ^ McDougall's writings on the instincts and 
sentiments and Freud's studies of the unconscious motives under- 
lying human nature, greatly increased our understanding of the 
difficult or nervous child, the delinquent and the neurotic. The 
first clinic for children was established by Witmer at the University 
of Pennsylvania in 1896, but this was mainly concerned with 
treating scholastic difficulties, In 1909 Healy commenced his 
studies of behaviour problem cases and delinquents in Chicago. 
Burt was the chief originator of similar work in England, followed 
by Boyd and Drever in Scotland. 


Industry 

, While many early psychological studies of fatigue and of con- 
ditions affecting learning were relevant to industrial work, the 
first important experiments in industry itself were those of Taylor 
and Gilbreth, in America, around 1910. The stress laid by these 
experts in "scientific management" and their followers on the 
Increased output resulting from more efficient working methods 
tended to antagonise workers and the Trade Unions. Although 
industrial psychologists have been at least as interested in the 


Co-operation in the Second World 
Labour and psychologists in the For 
Series of researches on conditions in British factories was begun 
by H. M. Vernon, under the Health of Munition Workers Com- 
mittee, in the first war, ang continued by the Industrial Fatigue 
Research Board (later the Industrial Health Research Board)—a 


War between the Ministry of 
ces. Nevertheless an important 
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usually attributed to Muensterberg, who worked in Germany till 
1911 and then in America. He outlined the functions of psychology 
in contributing to industrial efficiency and in serving the interests 
both of employees and employers, and, himself developed tests 
for selecting tramcar drivers, telephone operators and others. 
Gradually the view made headway that the choice of the right man 
is no less important than the choice of the right machine. It was 
estimated that, at a typical job, the most efficient workers are fully 
three times as competent as the least efficient. 


1914-1918 


This brings us to the first World War, when psychologists in 
Germany, Britain, France, Italy and America were engaged on the 
development of tests for night vision, for the selection of pilots and 
observers, drivers and telegraphists, for the anti-submarine service, 
etc., as well as on other contributions to military psychology which 
Burt (1942) has described. Few of these tests have survived, and 
in most countries psychologists were too few in number to under- 
take large-scale schemes or to have much influence on the service 
authorities, But an outstanding landmark in the history of voca- 
tional psychology was the application, during 1917-18, of group 
intelligence tests (devised by Otis and other American psycho- 
logists) to nearly two million American army recruits (cf. Yoakum 
and Yerkes, 1920, Memoirs of the National Academy of 
Sciences, 1921). Up to 2,000 a day were tested in any one centre, 
in batches of one to five hundred at a time. A battery of verbal tests 
—Army Alpha—was used with the majority, but a non-verbal 
series—Army Beta—was given to the 30 per cent. who could 
barely read English, and adaptations of the Binet scale, or per- 
formance tests, to borderline defectives. The value of the test 
scores was clearly demonstrated, and they could be used for : 

(1) Selecting men of superior ability for officer or N.C.O. 

training, or other special duties. T 1 

(2) Selecting duller men for labour or special training units, 

and for eliminating the lowest grades. j x 

(3) Balancing the distribution of ability between different units. 

(4) Allocating men into groups which would be homogeneous 1n 

ability and thus helping instruction. : 

Though these applications were permissive, not obligatory, 
psychologists had less trouble in persuading the Army to adopt 
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them than in dissuading it from attaching over-much importance 

to test scores and neglecting physical, personality, and other 

relevant factors. Owing to lack of time relatively little attempt was 

made to test aptitudes for specialised training, but extensive work 

. Was carried out on trade tests for assessing the proficiency of 
recruits from skilled and semi-skilled civilian trades. 


-type scholastic tests, 
with similar make-up and items to those of intelligence tests, 


foundation by M 
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Psychology in 1921. The N.I.LP. is an independent organisation 
whose chief activities include investigations for particular firms 
into their problems of selection, training and working conditions, 
and the giving of vocational guidance to individual applicants— 
mostly boys and girls leaving school—who desire advice on suit- 
able careers. So far as its limited funds would permit, it too has 
conducted researches, for example into tests of mechanical apti- 
tude, of motor-driving and other abilities, and into practicable 
techniques of guidance for school-leavers in large areas. While 
certain firms such as Rowntrees (cf. Northcott, 1945) have 
employed psychologists in their personnel departments, the 
majority of industrial and vocational psychologists in this country 
have been trained by the І.Н.В.В. and N.LI.P., and a great deal of 
the work of these bodies between the wars was directly relevant to 
the problems with which British psychologists were called upon 
to cope in 1941. Selection tests were applied to apprentices in 
the Army and the Royal Air Force, for example by Cox (1928), 
Stanbridge (1936), and Farmer and Chambers (1936). The writings 
of Burt et al. (1926) and Macrae (1932) have been particularly 
influential in the field of guidance. Р 
The first attempts to give vocational guidance to childrenare said 
to have been those of Parsons in America in about 1900, but many 
school teachers have long accepted the responsibility of advising 
their leaving pupils unofficially regarding suitable employment in 
the light of their knowledge of the pupils’ abilities and characters. _ 
Juvenile Employment Committees, working either under the 
Ministry of Labour or under Local Education Authorities, are 
nowadays concerned with vocational placement, and do their best 
to integrate the views of the school, the child’s and parents’ 
interests, and local labour requirements. But much evidence has 
accumulated as to the untrustworthiness of these haphazard pro- 
cedures and the superiority of the more scientific guidance methods 
worked out by the N.I.I.P. (cf. Chapter VII). Mention should be 
made also of the Appointments Boards or Committees for placing 
university students in jobs, the first of which was established at 
Oxford University in 1899. A growing number of secondary 
schools have part-time careers masters or mistresses, given intro- 
ductory training at the N.I.LP., who, in the quite inadequate time 
usually allotted, try to survey the pupils systematically and to give 
them guidance on leaving. Such activities are much better developed 
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in American high schools and colleges, where they are known as 
“Student Counselling". "They include advice to students on their 
Work and educational careers, financial, health and family diffi- 
culties, and personality maladjustments as well as vocational plans 
(cf. Williamson, Darley, et al., 1937, 1941). The vast “Advisement 
and Guidance” programme of the United States V. eterans' Admin- 
istration is on similar lines (cf. Scott and Lindley, 1946). Two to 
three thousand psychologists are engaged at some 400 centres in 
studying the potentialities and needs of ex-Service men and help- 
ing them to readjust to civilian life and to reach sound decisions 


Educational and Vocational guidance have often been given, in 
this country, by Child Guidance Clinics, During the 1920's 
Clinics spread rapidly, both in America and Britain, with the 
Support of the Commonwealth Fund. Their staff usually comprised 
à psychiatrist in charge of diagnosis and treatment, a social worker 
to explore the home background, and a psychologist to survey the 
E: _to carry out remedial educational work. 
ay and speech therapists were also often attached. This system 
ust as vocational guid- 
hologists and teachers, 


inics. When psychological 
méntal hospitals, it was all 
1 by the psychiatrist with a 
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particular mental functions in which deterioration may have 
occurred, and to analyse the general personality structure— 
provided he has received thorough theoretical, practical, and interne 
training. Such training is difficult to come by in this country as 
yet, but in America several hundred clinical psychologists were 
employed in the army during the war, and at least as many are now 
working for the Veterans’ Administration on diagnosis, psycho- 
therapy and research with neuropsychiatric cases, їп addition to 
those in civilian posts (cf. Miller, 1946; Hutt and Milton, 
1947). 

A vast amount of research, which we cannot attempt to sum- 
marise here, was published between the wars on topics of the 
utmost concern to vocational psychologists, e.g. the relative 
influence of heredity and environmental conditions on intelligence 
and other abilities; differences between the sexes, national and 
occupational groups, and age changes; new tests and diagnostic 
techniques, particularly in the field of personality; statistical 
methods such as applications of analysis of variance to psycho- 
logical data; studies of the structure, and relations between, 
abilities and traits; analyses of jobs and of working methods. One 
matter, however, which we must not neglect is the development of 
Vocational psychology in Germany. 


German Military Psychology 

The lead given by Muensterberg was taken up, aS Viteles (1926- 
30) has shown, more enthusiastically by German than by British 
and American industry. Numerous organisations, including 
Krupps, Zeiss and the Berlin Tramways had psychological depart- 
ments largely engaged in devising and applying selection tests to 
applicants for employment. Moreover, while military psychology 
lapsed completely in English-speaking countries during the 1920's 
and 30’s, Germany built on what had been achieved and actively 
developed it in the reconstitution of her Forces from 1927 onwards. 
It has been claimed, probably with justification, that the speed with 
which the Luftwaffe was re-created was largely due to the excellent 
quality of the human material selected for it. As Farrago (1941), 
Ansbacher (1941) and—more briefly—Burt (1942) describe, each 
Army Corps had its psychological section, staffed by carefully 
chosen personnel who had undergone military as well as psycho- 


logical training. Military psychology claimed to embrace personnel 
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based on the completion of simple drawings. Other, military, 
tests involved the drilling of recruits and instructing them in 
some task or lecturing to them, also carrying out complex 
orders requiring quickness of uptake, agility and endurance, 
and improvisation in emergencies. Candidates were encour- 
aged or severely criticised, in order that the effects of these 
social stimuli might be observed. 

(3) Expression analysis. Facial expression was observed, and 
often photographed by a concealed cine camera, during con- 
versation or tests involving distractions or painful stimuli. 
Speech, handwriting and literary style were also analysed in 
detail, and the significance of each mode of expression for 
the total personality considered. 

(4) Life-history. The candidate's party record was studied, and 
data were collected by questionnaire and interview on his 
past development, his aims and ambitions, his self-evaluation 
and social attitudes. At the close there was 2 discussion of 
some topic by all the candidates, designed to show up their 
competitiveness and other social reactions. Detailed reports 
were drawn up for the consideration of the superior officer. 

Although the selectees, both officers and specialists, were 

followed up at intervals and their progress compared with the 
Psychologists’ judgments, this comparison was of a subjective kind. 
Excellent or good agreement was alleged in 80-90 per cent. of 
cases, but there was no scientific validation either of the selection 
Procedure as a whole, or of the deductions made from the separate 
tests. In other words the claim that such and such a test revealed 
will-power, imagination, or other vague faculties was entirely а 
Priori, and in the absence of empirical substantiation, German 
Psychologists had little defence when they fell into disfavour. 
Actually their prestige was already declining in 1939 and their work 
. Was entirely stopped in 1941. It is generally believed that this was 
due to political prejudice, but Davis (1947) attributes it rather to 
the inherent weaknesses of their approach. ы 


„Conclusion \ ee. 
With the opening of the second World War, British psycho- 


logists were eager to put their training and experience to useful 


Servi. A 3 hology among naval and 
ce. But the prestige of vocational psycho MEE much lower 


military authorities and in Government circles 


CHAPTER II 
PERSONNEL SELECTION IN THE ROYAL NAVY 


Abstract.—Selection in the Royal Navy was somewhat unscientific 
and unsystematic up till 1941, when the Senior Psychologist’s 
Department of the Admiralty was established. The primary object 
of the department was to assist in selection of recruits at combined 
recruiting centres, and in their allocation to suitable branches at 
entry establishments. The staff of the department, their training, 
and the tests and other techniques employed, are described. Most 
of the qualified personnel were civilians, but interviewing, alloca- 
tion, etc., were largely carried out by Wren petty officers or 
Personnel selection officers. Important features were the provision 
of information on jobs to recruits, the matching of supply with the 
demands of all the branches, and re-allocation of failures. Personnel 
Work was later developed in depots, barracks and specialist schools, 
among Fleet Air Arm mechanics, Marines and artificers; and a 
Scheme of selection was integrated with the training of officer 
candidates, Follow-up or validation investigations, other research 
activities, studies of training, documentation, “work-simplifica- 
tion” and opinion surveys, are outlined. 
woe tenm 


Up till 1941 such selection as existed in the Royal Navy was 
based mainly on èducational examinations and on interview. 
Scholastic performance played a large part in the admission of 
cadets and of artificer apprentices, that is of future officers and 
skilled tradesmen, also a smaller part in the acceptance of con- 
tinuous service and special service ratings. War-time volunteers 
and conscripts (“Hostilities Only”) ratings were medically 
examined and interviewed at Combined Recruiting Centres 
(C.R.C.s). Both their acceptance or rejection and the category Or 
branch to which they were allocated were decided by the naval 
recruiter, who was usually a pensioner chief petty officer or petty 
officer, For certain classes of tradesmen such as electrical mechanics 
trade tests were given. Such specialists as gunnery, torpedo, Radar 
and Asdic (anti-submarine) ratings were nominally chosen after 
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a period at sea on the basis of a recommendation from their 
commanding officer, but, owing to shortages of experienced men 
around 1941, the major Proportion were merely put on training 
courses by the drafting officers at the naval depots or barracks 
without any enquiry into their suitability. Neither the recruiters, 
nor the officers concerned with selection, had any training in inter- 
viewing, and with the increasing complexity of modern equipment 
and the multiplication of specialist categories, they could seldom 
be expected to know much about the jobs for which they were 
selecting. Many of them employed unstandardised spelling a 
mental arithmetic questions as tests of education and intelligence : 
Several training schools had home-made group intelligence tests; 
fortunately, perhaps, little attention was paid to their results. Some 
schools made approaches to trained psychologists for assistance. 
Thus in December, 1940, one of the writers was asked to devise 
selection tests for Asdic ratings, because the quality of men sent for 


training was so poor. His battery (described on p. 231) was applied 
for the rest of the War, and it certai 


all, many hundreds of men made the lengthy journey to training 
Schools in Scotland, only to be rejected without even starting 
training. Moreover, the procedure took no account of the equally 
clamant needs of numerous other naval branches. These defects 


could only be overcome by a centrally organised scheme of 
selection. 


The system which work 
Navy could barely cope wit 
in 1941 the Navy was in`the fortunate 


: again by 1944, Not only were men 
€cessary mechanical or other aptitudes breaking the 
еагїз of the training staffs, but also many with excellent quali- 
atively unskilled work. The effects on 


example was the тесги 
at ? Spell Egypt with a J? You’re 


5 А 
с classic liter who was heard saying to a volunteer : 
illegible for the N; ауу.” !i 
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the morale of recruits in general were as serious as the shortages of 
competent personnel. 

Partly owing to the urgency of these difficulties, partly through 
the initiative of the Consultant in Neuropsychiatry, a committee 
was appointed by the Admiralty in March, 1941, to study the 
selection and allocation of Hostilities Only recruits. A. Rodger 
(1945) has described the visits paid by the committee to combined 
recruiting centres, entry establishments, depots, training schools, 
and ships of the Fleet; also how its recommendations were 
accepted in the main, the Senior Psychologist's (S.P.) Department 
being established under the Second Sea Lord, and new schemes 
instituted at recruiting centres and entry establishments before the 
end of the year. 


The Staff of the Senior Psychologist's Department 

К Тһе backbone’ of the psychological staff consisted of eight 
Civilian psychologists provided by the National Institute of Indus- 
trial Psychology. Five other men with psychological qualifications 
Joined later, either as civilians or as R.N.V.R. officers, and two 
trained psychiatric social workers served throughout most of the 
War. But the bulk of the personnel had little or no psychological 
training at the start. At the peak period they numbered about 300, 
including 50 Wren first, second or third officers (personnel selec- 
tion officers), 120 Wren petty officers or C.P.0.’s (recruiting 
assistants), and some 30 Wrens or Marines engaged on clerical 
duties, 15 sick berth attendants (assistants to neuropsychiatrists), 
and about 20 civilian clerks and statistical assistants (cf. Expert 

ommittee, 1947). ` : 

_ The Wren recruiting assistants were m. 
civil life, where they had been teachers, employment officers, 
Social workers and the like. Though only a small proportion were 
University graduates, they were nearly all of university calibre and 
Were extremely carefully chosen for the work in view. They 
received a fortnight’s intensive training in their future duties, 1n 
Which the Director of Naval Recruiting and the бошо Ж 

europsychiatry and Ophthalmology co-operated. They had 
acquire skill in interviewing, testing technique and record-keeping, 
and to obtain some acquaintance with the educational systems о 

Titain and with the nature of civilian occupations. Their wor k was 

‘Tegularly supervised by the psychiatric social workers. 


ostly drawn direct from 
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From their number the most suitable were later chosen for 
promotion to personnel selection officer (P.S.O.). Though they Bag 
by now gained a good deal of relevant experience, they impo 
much additional training in interviewing and in knowledge o 
civilian and naval jobs. For it was they who had to decide whether 
a man claiming to be a fitter, a lathe-setter, a radio mechanic, etc. 
was properly qualified; or whether a recruit would most suitably 
adapt to training as electrical mechanic, stoker, telegraphist, sea- 
шап, writer, or cook and so on; and their employment recom- 
mendations were almost always accepted. Their training consisted 
mainly in an apprenticeship to an industrial psychologist who was 
already carrying out actual selection, but included visits to special- 
ist schools, conferences and refresher courses at the Admiralty, 
often a period in a Government training centre to learn about the 
handling of bench and machine tools, and spells at the Admiralty 
to gain acquaintance with the administrative system and recording 
arrangements. 


Certain points deserve comment, F irst, the Senior Psychologist 
—the di 


all the policy and administration 
were decided and carried out by psychologists. Although psycho- 
logists as such did not at that time receive professional recognition 
in the Civil Service, as Scientists or doctors did, yet there was 4 
Strong scientific atmosphere in the department and an insistence 


Secondly, Most of the Staff, 
logists, worked in the field. 


e and foolproof as po: 


E 


ssible in order to hand them 
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over to P.S.O.s or other less skilled personnel, so that he might be 
free to turn his attention to yet another problem (cf. Straker, 1944). 
The same principle is highly appropriate in peace-time industry 
and education where, as shown below (р. 100), industrialists and 
teachers will need to be trained to carry out most selection or other 
psychological work. 

The fact that nearly 90 per cent. of the personnel were women 
was due primarily to the non-availability of men of similar intel- 
lectual calibre. But the success with which they acquired know- 
ledge of male occupations, and handled thousands of individual 
recruits or tested large groups, and became accepted as useful 
members of the staffs of almost all-male establishments, was very 
striking. It is more because of the reduction in the size of the 
W.R.N.S. than because of any inability of women to carry out 
selection, that much of the post-war work has been taken over by 
R.N. and R.N.V.R. personnel. This, too, carries a lesson for peace- 


time selection and guidance. 


The Recruiting Centre Scheme _ 

The function of Wren recruiting assistants was to supply the 
recruiters with factual information on the basis of which they could 
More effectively decide the suitability or otherwise of candidates 
for the Royal Navy. This they did by applying: . 

(1) A biographical questionnaire, designed to bri 

tional and occupational history, leisure interes 
ence of leadership. 

(2) The Progressive Matrices test, S.P. Test 0. ; 

(3) Selected plates from the ‘American Optical Company's 

version of the Ishihara and Stilling colour blindness tests. 

(4) A short interview, mainly for purposes of clarifying written 

questionnaire responses, but usually including also a ге, 
of informal questions about nervousness, dieting and the like 
for detecting recruits who ought to be referred for psychiatric 


examination. to head- 
Careful records were kept and weekly returns Sent 10 
quarters, The questionnaires and test results of accepted can- 
didates were forwarded to the recruits’ entry establishments: 
Although no fixed pass-mark was set on the intelligence test, к 
men from the bottom 10 per cent. of the population were Mire 
Unless, in the opinion of the recruiter, they possessed other 


ng out educa- 
ts and experi- 
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the majoríty of recruits to approximately the median for the € 
population. An analysis of returns and reasons for rejection over t 


942, showed that out of every 100 


74-4 were accepted 


13-4 were rejected on medical grounds 
47 were rejected because 


B. athematics, С; Spelling, D. Abstraction. ^ 
E. Advanced arithmetic, 


Entry Establishment Scheme 


rity of Н.О, Tecruits were sent to one of four 
ts to receive th 


‘The 

In 1941 the majo 
entry €stablishmen; 
manship, In each о; 


pent a fortnight PT 
raming, in “kitting up”, and in medica 
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and psychological examinations. In addition to one resident and 
two or more visiting psychologists, there were 15 P.S.O.s to deal 
with some 1,200 men a week, and 25 Wren ratings who were 
engaged mainly on test correcting and records. The resident 
psychologist was either a civilian or a R.N.V.R. officer. Also work- 
ind with his department were a psychiatrist and a R.N. executive 
officer, whose respective functions were to interview men referred 
by P.S.O.s as possible neurotics or as potential officers. "here 
were six main features of the procedure in H.M.S. Royal Arthur, 
namely: 
1. Provision of Information.—Posters, photographs, and bulletins 
г describing the work, rates of pay, prospects and necessary qualifi- 
cations, were displayed for incoming recruits to study and discuss. 
A preliminary talk by the psychologist or a P.S.O. both described 
the selection procedure, and drew attention to the various naval 
jobs, emphasising that it was up to every recruit to do the most 
difficult job of which he was capable, and not merely one which he 
thought would give him a pleasant time. Films and trade test pieces 
were also sometimes demonstrated. i 
2. More Thorough Testing —The standard naval battery of four 
pencil-and-paper group tests was given: 
Test 1. Modified Shipley Abstraction. 
2. Modified Bennett Mechanical Comprehension. 
3a. Arithmetic. 3b. Mathematics. 
4, Squares Test of Spatial Judgment. ; 
The sum of scores on these four (with Test 1 doubled) gave the 
total score, known as T2, which was found to be the most useful 
index of all-round potentiality in the Navy. A test of dictation or 
spelling, and a mechanical + electrical information test, were 
generally applied, and when time allowed various tests were given _ 
experimental trials. No tests involving apparatus were feasible with 
such large numbers. : 
3. Employment Interviewing.—Yhough each recruit had received 
a tentative allocation at his recruiting centre, this could often be 
revised at the entry establishment in the light of the additional 
information collected, and of the Navy's current requirements. He 
therefore received a 10 to 30 minute interview with a P.S.O., lead- 
ing up to a final employment recommendation. The interview was 
based on his original questionnaire, to which the results of the 
newly-taken tests had been added. Standardised oral tests of 
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fitting knowledge, machine tools, sheet metal work, petrol engines, 
diesel engines, and electrics, were applied in suitable cases, and 
other questions designed to elicit trade experience were often based 
on trade test pieces (cf. p. 149). Individual performance tests— 
Cube Construction or revised Kohs Blocks—were occasionally 
given. P.S.O.s possessed up-to-date information on the quotas for 
different branches and on job requirements. It should be noted 


that the procedure was as much one of guidance as of selection, and 
that the eventual choice, 


-O.s sometimes interviewed more than 30 men a day. 
.*. Matching Supply and Demand. —As indicated above, selec- _ 


, Since it almost inevitably 


eading seamen, petty officers, and most 
tive officers would eventually be drawn. : 

n in the Forces was certainly more difficult than 17 
stantly changing demands: 


R.N.V.R. execu 
Selectio: 
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courses, were returned to H.M.S. Royal Arthur for re-allocation. 
In 1944 a scheme was instituted whereby the training schools had 
to supply details as to the reasons for failure. This was particularly 
useful since, when the failure was attributable to mistaken judg- 
ment on the part of a P.S.O., she could investigate the case herself 
and profit thereby. The morale of these failed men was often very 
low, hence the employment interview served the valuable function 
of re-establishing their self-respect, as well as discovering a more 
suitable job to which they could be transferred. 

6. Training and Research.—Lastly, H.M.S. Royal Arthur served 
as a training ground for newly-promoted P.S.O.s, for neuro- 
psychiatrists’ assistants, and other selection personnel. Moreover, 
it provided ample opportunities for applying additional tests or for 
other investigations, whereas previously psychologists had scarcely 
been able to obtain more than one hour for testing. 


Other Aspects of Naval Selection 


Collaboration with N europsychiatrists Though there were many 
instances of fruitful collaboration between industrial psychologists 
and medical psychologists or neuropsychiatrists, the relationships 
of the two groups were less cordial than might have been anti- 
Cipated. Some psychiatrists tended to be suspicious of the ability 
of psychologists to assess temperamental or personality que 
by interviews. Psychologists, on the other hand, were doubtful o 
Psychiatrists’ knowledge of jobs, and did not wish to become үе 
ey associated, in the eyes of the Navy, with psychoanalysis an 
abnormal psychology. : , 

Although tt as oye in joint investigations, that dass 
Picked by recruiting assistant Wrens as unstable were well wort 
referring to psychiatrists (cf. р. 157), various practical ачы 
Prevented such referrals from actually taking place. In ШМ Р 
Royal Arthur, however, some ‘7 per cent. of men CRM b 
Neurotic tendencies by the P.S.O.s were seen by the res! a 
Psychiatrist, and 0-7 per cent. were recommended by him tor 
discharge, He in his turn taught the p.S.0.s what to look for. 1 

In their work at naval mental hospitals and clinics the neuro 
PSychiatrists were provided with clinical assistants who Des 
trained to apply portions of the Wechsler-Bellevue adult o Е 
Bence scale, and the Trist-Misselbrook revision of Kohs Blocks, 
to individual patients. 

P.S.—2 
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Work at Depots and Branches—One or more P.S.O.s was 
attached to each of the larger naval holding and drafting centres, 
where they assisted in the re-allocation of recruits returned from 
sea. Many men who had entered the Navy before the S.P. depart- 
ment was established were thus tested and interviewed. The choice 

. of seamen for training in gunnery, torpedo, Radar or Asdic was 
made at the depots, and, though first priority was always given to 
men who had been recommended at sea, the P.S.O.s helped con- 
siderably in raising and maintaining the quality of other trainees. 
In practice, almost the whole of the upper half of the naval popu- 
lation (i.e. men with good education and work record, not merely 
good intelligence) were recommended for some form of specialist 
training. The same P.S.O.s were largely responsible for collecting 
from specialist schools the results of training courses, upon which 
most of the follow-up investigations were based. 

Officer Selection —Experimental application of tests and psycho- 
logical interviews in H.M.S. King Alfred —the training establish- 
ment for R.N.V.R. cadets—gave promising results in 1942. After 
study of the War Office Selection Board procedures (cf. Chapter 
IV), a modified scheme was introduced in the following year which 
spread out the selection process over a much longer period. Many 
possible candidates were ear-marked by P.S.O.s in entry establish- 
ments, and these were further interviewed by a psychologist and 
a R.N. executive officer. Those picked out underwent their first 
naval training in a new establishment, to which a psychologist and 
a testing officer familiar with army methods were attached. Some 
of the “leaderless group” and other army tests, adapted to naval 
conditions, were applied, but the Board’s eventual decision as to 
whether the men should go forward as “C.W. candidates" was 
based mainly on observation of their behaviour during three 
months’ strenuous training: Though there was no psychiatric 
interview, the psychologist studied each man intensively and 
applied certain projection and other personality tests. 

“New-type” Boards were later introduced for candidates from the 
Fleet, and for “Special Entry” cadets. "The candidates usually spent 
a week under observation, and were tested and interviewed. Thus 
the Admiralty Selection Board received a large amount of informa- 
tion on which to base its decisions. Work was also done in selection 
for engineering commissions, and with officers needed for fighter- 
direction. Some 10,000 candidates in all were covered during the war. 
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Although the standard tests, making up T2, differentiated well 
at the officer level, several other more difficult tests were tried out, 
the results with which are described in Chapter XIII. 

Miscellaneous Fields of Selection —Fleet Air Arm pilots and 
observers were selected and trained by the R.A.F. But telegraphist 
air-gunners mostly passed through H.M.S. Royal Arthur, and all 
the various types of air fitters and air mechanics (electrical, air- 
frames, engine and ordnance) were handled by S.P. department, 
constituting indeed one of its major—and most successfully accom- 
plished—tasks (cf. p. 121). Selection and re-allocation were also 
applied on a large scale in the Royal Marines. The transference of 
large groups of surplus R.A.F. and army recruits into the Navy, 
and vice versa, raised serious problems o£ morale. Talks by psycho- 
logists and careful individual selection helped considerably to 
lessen the recruits’ sense of frustration. Some help was given, 
mainly since the close of the war, in allocating Special Service 
recruits and Continuous Service boys. Among boys being trained 
as electrical, engine room, ordnance or air artificers (who had been 
selected by educational examinations) it was shown that a battery 
of mechanical and intelligence tests was of value both in the initial 
acceptance and in the allocation to the different trades. 

Records and Headquarters’ Activities —During the war some 
350,000 recruits passed through the selection procedure outlined 
above. All questionnaires were eventually housed in a central file, 
though a copy of the main items was first made and attached to 
each recruit’s Service documents. Weekly returns of all test 
results and allocations were sent in. The filing system was SO 
arranged that details on every recruit examined could be looked up 
at once, and it was usual for P.S.O.s in depots or other establish- 
ments to telephone the names and official numbers of individuals 
with whom they had to deal and to obtain the information forth- 
with. Two great advantages of this system were, first, that it was 
unnecessary to re-test recruits since first scores could always be 
traced, and, secondly, that it made possible extensive follow-up. 
The final course marks of, say, radio mechanics could readily be 
compared with the selection data collected one-and-a-half years 
earlier. Frequent returns from the specialist schools were also 
analysed at headquarters in order to provide a check on the quality 
of the material which was being sent to them, or on any changes 
in their requirements which might affect selection procedure. 
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The other main activities of headquarters staff included: 


(i) 


(iv) 


(3) О 


(4) 
(5) 


The preparation and dispatch of questionnaire forms, test 
material, information bulletins, posters, etc. 


requirements, pay, promotion, etc., were better investigated 
at the Admiralty. 

Liaison with other departments, such as the Directors of 
Naval Recruiting, Naval Training, Personal and Medical 
Services, and the Education Department. 


Success of trainees obtained from various sources, with 
varying previous experience and background. 

Propaganda for attracting suitable candidates. 

*ganisation of the syllabus: working hours, distribution of 
Periods, size of classes, etc, 

Effects of conditions of work and training on morale. 


F unctions of theoretical instruction and its integration with 
Practical training. 


Use of visual aids including diagrams, charts, films and 
demonstration models, 
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(7) Methods of examining and marking. 

_Attempts were made also to consider how far the “training as 
given,” and examinations at the end of training, correlated with the 
“job as done,” since it appeared in some branches that undue 
stress was laid on knowledge or skills which would seldom be 
required at sea (cf. p. 108). Probably most of the psychologist's 
recommendations did not carry much weight at the time. Never- 
theless a large measure of goodwill was built up, and by the end of 
the war there was a readiness to consult psychologists in the early 
stages of planning instruction or of producing training devices, 
which would have been unthinkable five years earlier. Though 
psychologists were members of Admiralty committees on training 
apparatus and training aids, there was little opportunity for con- 
trolled experimentation. One large-scale investigation, however, 
demonstrated the value of film and filmstrip in the elementary 
instruction of seamen (Vernon, 19462). Little use was made by 
naval schools of objective examinations until, towards the end of 
the war, American practice and the advice of British psychologists 
began to have some influence. A simplified manual of “new-type” 
examining was drawn up recently. 

Another field where there was scope for psychological advice 
was documentation. Even before the war the N.LI.P. had been 
consulted when the form for reporting on the efficiency of naval 
Officers was undergoing revision. Recently a survey on behalf of 
the Director of Naval Recruiting showed how the paper work 
involved in the entry of recruits to the Navy (forms, records and 
returns) might be approximately halved. 

During job analysis and training investigations many instances 
of bad layout or design of equipment were noted. Although such 
matters have largely concerned psychologists in industry, there 
was no ready means of bringing the psychological viewpoint to the 
attention of the naval scientists and engineers responsible for the 
designs. However, the Royal Naval Personnel Research Committee 
of the Medical Research Council did useful work in bringing 
together medical specialists and representatives of Admiralty 
departments dealing with training, clothing, gunnery, scientific 
research, naval construction, and others. Under its auspices investi- 
gations were made not only into equipment, but also into working 
and living conditions in submarines and the small craft of coastal 


forces, in the Arctic and in the Tropics (Critchley, 1947); into 


that the work of vocational psychologists 
ated for a large majority to favour the 


CHAPTER UI 
OTHER RANK SELECTION IN THE ARMY AND A.T.S. 


Abstract.—Early experiments in the testing of Army and А.Т.5. 
recruits achieved only a limited success. In 1941 the Directorate 
for Selection of Personnel (D.S.P.) was established, with powers to 
test, interview and make employment recommendations before 
recruits were assigned to their Arms of the Service. The staff of the 
directorate was almost wholly military, and certain difficulties 
arising from the restriction of the functions of qualified psycho- 
logists are pointed out. The work of selection was carried out by 
personnel selection officers and sergeant testers during the recruits’ 
primary training. This consideration of each individual’s abilities 
and interests at an early stage in his army career had excellent 
effects upon morale. The testing, interviewing and other pro- 
cedures, and the ear-marking of potential officers and tradesmen, 
are described, together with the corresponding methods in the 
A.T.S. Much work was done in the re-allocation of converted 
units, of misfits, and other groups at home and overseas. A detailed 
analysis of Army and A.T.S. employments was made in 1941-2 
and kept up to date. The directorate was not concerned with appli- 
Cations of psychology outside selection, but training, motion study 
and other investigations conducted independently are briefly 
mentioned. 
ss SS 

The Army had even greater difficulties than the Royal Navy or 
R.A.F. in meeting its skilled manpower requirements, since it 
attracted a smaller proportion of men of high intelligence and 
education, and, being unable to reject any except the medically 
unfit at recruiting centres, it had to take in a considerable propor- 
tion of very dull men. Recruits were posted on call-up by the 
Ministry of Labour straight to some Corps or Arm, and there was 
no ready means for subsequent re-allocation of the unsuitables. 
The complexity of modern war has greatly increased the need for 
specialists, and the more technical Arms in particular failed to get 
enough tradesmen or trainable men. Moreover, as the Beveridge 
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enquiry showed in 1941, many highly skilled men were engaged 
on semi-skilled or unskilled jobs. The effects on instruction, and 
on the morale of misplaced soldiers, were extremely serious. It 
was noted, for example, that more neurotic breakdown occurred 
among recruits under training than in battle, and unsuitability of 
employment appeared to be one of the factors responsible (cf. 
Sutherland and Fitzpatrick, 1945). 

In 1939 the Army Council agreed to have an intelligence test 
given experimentally in military training units, under the technical 
direction of a small group of civilian psychologists, headed by E. 
Farmer. They devised a 20-minute omnibus verbal and mechanical 
comprehension test known as F.H.3 (later F.H.R.), which for the 
most part had to be applied by regimental officers who were 
unskilled in testing. Nominal rolls of low scorers were drawn up 
to which the Army psychiatrists occasionally had access. But as 
there was still no effective provision for transfer of men to other 
Arms, or for the discharge of mentally defective or unstable 
recruits, very little was accomplished. 

An alternative scheme, put forward by A. Rodger early in 1941, 
with the help of G. R. Hargreaves, was the foundation of more 
fruitful developments. Hargreaves, an Army psychiatrist, had 
already obtained promising results with the Progressive Matrices 
test among R.A.M.C. personnel, and he influenced General Sir 
Ronald Adam, whose personal interest and energy, when he 
became Adjutant-General, played a major part in what followed. 
Eventually an advisory committee of psychologists was appointed, 
consisting of Myers, Drever, Burt and Philpott. Their main recom- 
mendations were that a new directorate for selection of personnel 
be set up, that an intelligence test be given at recruiting centres on 
medical examination, and that more detailed psychological and 
psychiatric examination be carried out at mobilisation depots, 
before recruits were allocated to any Arm (cf. Myers, 1942-3). 
These proposals were implemented in July, 1941, though it was 
not until a year later that the third one—namely, the General 
Service scheme—could be put into effect, Psychological staff had 
to be collected, personnel to carry out the programme had to be 
trained, a job analysis made of the main Army and A.T.S. employ- 
ments, and suitable tests and procedures devised, before effective 
selection could be instituted (cf. Tuck, 1946; Directorate for 
Selection of Personnel, 1947). 
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The Staff of the Directorate 

All executive functions were vested in the hands of Army 
officers, partly because of the small numbers of trained psycho- 
logists available, but mainly because it was thought that only a 
scheme run by soldiers would be acceptable to soldiers. The 
director, a brigadier, had to keep in intimate touch with the con- 
stantly changing requirements of the Army. Under him was a large 
staff of non-technical officers responsible for organisation, admin- 
istration, supplies, etc., and all the day-to-day application of 
selection was carried out by personnel selection officers (mostly 
regimental Majors or captains), presidents (colonels) and military 
testing officers at selection boards, and testing assistants (sergeants) 
who were given a small amount of psychological training. Psycho- 
logists, all but two of whom were put into uniform, were confined 
to planning and inspection of selection procedure, training the 
non-technical staff and carrying out research and development 
Investigations. At the peak period there were 19 psychologists 
(including 5 women) and 31 officers or promoted sergeants with 
Some training (5 women), together with nearly 600 non-technical 
officers and 700 N.C.O.s (of whom about 50 and 200 respectively 
Were women). In addition a few qualified personnel were attached 
to the Directorate of Biological Research, the Department of the 
Scientific Adviser to the Army Council, and the Director-General 
of Army Medical Services (cf. Expert Committee, 1947). 

Most of the trained psychologists were brought in from the 
universities; but the chief psychologists to the Army and A.T.S. 
and four others were past or present members of the N.LLP. staff. 
Р.5.0.5, M.T.O.s and sergeant testers were volunteers from the 
Army, chosen in the first place for good intelligence scores, 
televant past experience, and apparently suitable ‘personalities. 
Sergeants had two or three weeks’ training which dealt with the 
nature of the tests, their application and scoring. P.S.O.s received 
a month's training in principles of vocational selection, job 
analysis, assessment of educational level and occupational experi- 
ence, Army requirements and documentation, the nature and use 
of tests and interviewing technique. A stiff written and practical 
examination was set before they were finally accepted, the former 
Covering knowledge of procedures. The latter consisted of inter- 
viewing “stooges,” obtaining information from them and making 
employment recommendations. Little or no instruction was given 
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in general psychology as such, though reading was encouraged. 
Headquarters administrative staff and board presidents had no 
systematic training (the higher the rank and the greater the 
responsibility, the more difficult it became to demand technical 
qualifications). Nevertheless, many of them voluntarily undertook 
considerable study of the psychological background of their work 
and of the techniques they had to apply. 

While the huge size of the task to be done in the Army necessi- 
tated a large non-technical staff, it seems doubtful, judging by the 
experience of the Navy, whether quite so much timidity about the 
reactions of the Army to psychology was justified. Without 
belittling in any way the very fine work done by all types and 
grades of staff, it must be admitted that the organisation did not 
always function smoothly. Psychologists suffered considerable 
frustration. They were commonly of lower rank than their non- 
technical colleagues. Policies which they advocated as scientifically 
sound were often rejected, and the methods they devised were 
often misapplied and misinterpreted by insufficiently trained 
personnel. Their training had perhaps predisposed them to seek 
what was best for the interests of the individual soldiers with 
whom they had to deal, and they were less experienced in envisag- 
ing the broader needs of the Army as a whole. Again, the fact 
that they were immured in headquarters (apart from occasional 
visits for inspection or for carrying out experimental investigations) 
tended to widen the gulf between the technical and the practical 
aspects of selection. The lot of the sergeant testers was particularly 
deplorable. Many were highly intelligent teachers and university 
graduates, but, except for a few brought to headquarters as 
research assistants and statisticians, they were restricted to routine 
application and scoring of tests under the command of P.S.O.s 
whose educational and psychological qualifications were sometimes 
inferior to their own, and were liable to be put on to cutting the 
grass or other duties at the whim of any С.О. A quarterto athirdof 
them eventually achieved commissions by the same route as other 
Army recruits, but not on grounds of technical competence at 
their work. Presumably the lesson to be drawn is that psycholo- 
gists cannot expect a complex institution like the Army to accept 
novel procedures merely ой scientific grounds, that gradual edu- 
cation and infiltration rather than the imposition of technically 
valid methods are needed. 
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The General Service Scheme 
'The introduction of the Matrices test, given by sergeants at 
recruiting centres, allowed a better distribution of the available 
good-quality material among the Arms that needed it most, 
although no recruits were rejected by the test. When the General 
Service scheme came into operation, recruiting-centre testing was 
abandoned (except for regular volunteers). During the winter and 
spring of 1941-2 there was intensive work on an Army job analysis, 
the preparation of new tests and experimental trials of the full- 
scale procedure. Under the new scheme all recruits were enlisted 
into the General Service Corps and sent to one of the numerous 
Primary training centres (P.T.C.s) up and down the country for 
Six weeks, during which selection took place and a common 
syllabus of initial training was given. Some 160 P.S.O.s and 500 
sergeants dealt with the fortnightly intakes which normally 
included some 12,000 men, but occasionally rose to double this 
number. The following tests were given: 
Progressive Matrices: 
Test 2. Bennett Mechanical Comprehension. 
Test 3A. Army Arithmetic and Mathematics. 
Test 4. Squares test of Spatial Judgment (dropped in 1943). 
Test 25. Verbal test (initially Test 17. Messages). ^ 
Test 16. Physical Agility. р 
Additional confirmatory tests were applied to those considered 
Suitable for certain types of employment, namely: 
Test 12 or 21, Clerical or Instructions (later introduced into 
the general battery in place of Squares). 
Test 8. Assembly test of Mechanical Ability. 
Tests 10 and 19. Morse Aptitude and Auditory Acuity. « 
Scores on each test were expressed as Selection Grades or S.G.s 
(cf. p. 177). After follow-up investigations had revealed the value 
of the Matrices, Bennett, Arithmetic, Verbal and Instructions 
tests in most Army jobs, the S.G.s on these tests were summed to 
yield the measure of general intelligence known as Summed S.G. 
A qualification form (cf. p. 132), similar to the naval biographical 
questionnaire, was filled in under supervision of the tester. This 
was used as the basis for the interview of each man by the P.S.O., 
in which the details were clarified, the recruit's interests consulted, 
and employment recommendations reached. Most of these recom- 
mendations consisted only of broad categories, both because 
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sorting had to be carried out centrally, and because supply could 
not otherwise be matched with demand for all the tremendous 
variety of Army jobs,. The following list of T.R.s (Training 
Recommendations) was used, after some revision. 

0. Potential tradesmen. - 

1. Drivers. 

2. Mechanical maintenance (later omitted). 

3. Signallers. 

4. Constructional, building, and other heavy duties (originally 

layers and fine operators). 

5a. Clerical and 5b. Storekeeping. 

‚6. Mobile and combatant duties, including riflemen and gun 
numbers. 

7. Orderly duties. 

8. and 9. Pioneer Corps or discharge. 

Three recommendations were given in order of suitability, 
though often a recruit received two identical Т.К.5, e.g. 667, mean- 
ing that both his first and second choices were for straightforward 
combatant jobs. The T.R.s and other information such as age, 
medical category and Matrices (later Summed S.G.) results were 
forwarded to the War Office and entered on punched cards for 
sorting and posting. If the requisite number of, say, drivers for 
R.A.C., infantry and other Arms could not be obtained from men 
with T.R.1 first choices, second choices were drawn upon. The 
postings were then sent back to the P.T.C. within three weeks, and 
the P.S.O.s entered the final T.R., this time with a suggestion 
as to a specific job within the T.R. (e.g. dispatch rider, tank 
driver, etc.). . 

In making their recommendations P.S.O.s had, in addition to 
Army job analyses, lists of minimum standards on the various 
selection tests, medical categories and other qualifications regarded 
as essential for each 'T.R. Some perhaps applied this scheme too 
mechanically, while others were too apt to ignore it and trust their 
own subjective judgments. But the majority certainly attempted to 
integrate all the relevant information about each man, including 
his test results as one important but not overwhelming factor. 
The Р.5.О.з” work and the sergeants' testing and scoring were 
supervised by Command P.S.O.s who were psychologically quali- 
fied officers. Moreover, records of the allocations were forwarded 
to headquarters where the more glaring misplacements or disregard 
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of instructions were checked. The occasional P.S.O.s who were 
clearly incompetent could thus be eliminated from the directorate. 

Some 14 per cent. of recruits were referred to an Army psychia- 
trist working at the P.T.C., including all those of very low 
intelligence, men considered to be poor in “combatant tempera- 
ment," and possible neurotics or psychopaths. Many of these 
were passed as normal, but those with psychiatric disabilities 
were either allocated to appropriate non-combatant employments, 
or to the Pioneer Corps (armed or unarmed), or sent to mental 
hospitals or discharged. By wise placement many emotionally 
unstable men were helped to better adjustment than they showed 
in civil life, and the Army's use of mentally defective and very dull 
men in the Pioneer Corps was an outstanding success. Recruits 
Who as civilians had been drifters or unemployables were found to 
Bive excellent service at simple labouring tasks, when working and 
living with others of a similar level of ability under specially chosen 
officers and N.C.O.s. Their morale was high, and their sickness 
and delinquency rates low. Moreover, the dull, hard jobs of the 
Army were probably better done than they would have been by 
more intelligent recruits (cf. Rees, 1946). 

Potential tradesmen were also identified individually. P.S.O.s 
Were provided with a Guide to Civilian Occupations which helped 
them to assess both the amount of trade experience claimed by 
recruits and its relevance to Army trades. Later, a series of written 
tests of trade knowledge was prepared, each covering a particular 
Army trade. Often the number of skilled, or semi-qualified men 
was insufficient; and the balance was made up from inexperienced 
recruits of high intelligence who were considered to be rapidly 
trainable. A “Short List" was issued each fortnight of the trades 
for which candidates were specially needed: Actually,as shown later 
(p. 120), these men put up by P.S.O.s did distinctly better at trade 
coursesthan the semi-qualified tradesmen submitted by the Ministry 
of Labour, or others put up at their own request, or by their C.O.s. 

Р.5.О.з also had the task of assessing good or poor employment 
record, very strong or weak combatancy ог aggressiveness (some 
5 per cent. of men were usually placed in each extreme category), 
and suitability as a potential officer. Those in the latter group were 
automatically sent to War Office selection boards after six months 
training, along with other candidates recommended from their 


units, and, in fact, provided the major source of officer material 
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in the second half of the war. When strong experimental evidence 
had accrued as to the value of certain of the standard tests in 
selecting successful candidates, P.S.O.s were instructed to ‘take 
particular account of.the combined score on these tests*. Many 
however, preferred to rely on their own hunches regarding the 
officer-like qualities of their interviewees. 

Between 1942-5 some 700,000 recruits passed through the G.S. 
Corps of whom 9 per cent; were chosen as potential tradesmen and 
6 per cent. marked as potential officers. Each man's qualification 
form, with test results and 'T.R.s entered, was attached to his 
Service documents, but a copy of the main items was entered on a 
card and stored in headquarters for research purposes. Unlike the 
naval system, there was no easy means of disinterring the selection 
information about any given individual, and when recruits came 
up for later re-allocation, their original scores were seldom avail- 
able. Many men went through the same battery two or more times 
—a proceeding whose bad effects were only partially overcome by 
the provision of retest norms. Not until the end of the war was it 
possible to prepare parallel versions of the chief tests. 

Although less stress was laid in the Army than in the Navy on 
self-guidance, P.S.O.s gave an introductory talk to each batch of 
recruits and spent part of their interviews providing information 
on thé types of employment likely to suit them. 

In conclusion, selection procedure on so large a scale had its 
inevitable weaknesses, but there is no doubt that the fair and 
thorough consideration given to each individual had a most favour- 
able effect on morale. Even apart from the saving in misplaced 
manpower, of which evidence is given below, this feature provided 
justification for its introduction into the Army. 


Initial Selection in the A.T.S. 


Selection procedure in the А.Т.5. which is fully described in 
an article by Mercer (1945), differed little from that in the Army 
and was run under the same directorate. By 1942, with the intro- 
duction of compulsory national service for women, the numbers 
in the Service reached over 200,000, and there were more than 
a hundred different employments. Some of these could be filled 


The combination Was score on Instructions-+ Bennett, minus the score оп 
Arithmetic+ Matrices. This closely duplicated the weightings obtained from the 


multiple regression equation (cf. . 104) b th .S. test battery an 
W.O.S.B. Pass or Fail. et P paeem die, G $ 
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directly by recruits engaged in similar civilian jobs, but there were 
many with no civilian counterpart, such as instrument numbers 
(anti-aircraft personnel), wireless operators, tinsmiths and fitters. 
Again many different types of work might be involved in jobs 
called by the same name; for example, there were eighteen kinds of 
clerk. One of the first steps, therefore, as in the Army, was a job 
analysis of the main varieties of employment, special attention 
being paid to the conditions under which the work would be done 
(sedentary, outdoor, ‘dirty, long hours, etc.), and the degree of 
individual responsibility involved, as well as to the capacities and 
training needed. 
At the beginning of the war most of the allocation was done by 
the commandants of the basic training centres who often had to 
deal as best they could with, perhaps, 250 recruits in an afternoon. 
Early in 1941 an experimental battery of sensory-motor and per- 
ceptual tests, devised by the Cambridge Psychological Laboratory, 
was instituted for the selection of A.A. instrument numbers, This 
was taken over by the directorate, but Jater found to be less reliable 
and valid than ordinary paper-and-pencil tests (cf. р. 248). The 
Matrices test was given at recruiting centres by A.T.S. sergeant 
testers from 1942 onwards, and between 5 per cent. and 25 per 
cent. of the lowest scorers were rejected, depending on the number . 
and level of recruits required. It should be noted that there was 
“no temptation to fail this test in order to escape conscription. Most 
recruits were volunteers, and all the others had chosen the A.T.S. 
Іп preference to some alternative form of service. There was no 
interviewing or allocation at recruiting centres. ` 
‚ At the A.T.S. basic training centres, the procedure was Very 
Similar to that in Army P.T.C.s, except that, with the smaller 
numbers, recommendations for particular jobs could be made 
Instead of for general types of duty. Usually two or three sugges- 
tions were listed, which enabled supply to be matched with 


demand. P.S.O.s in the A.T.S. were chosen to be as tactful and 
sympathetic in manner as possible, since many personal problems 
(womien) psychia- 


were apt to be raised by recruits. Referrals to ( ү 
trists were of value, but were less frequent than in the Army, bo 
because of the exclusion of low Matrices scorers and because undue 


lack of combatancy did not need to be catered for. A large amount 


of information on jobs was made available to recruits by means of 
f the Agility 


photographs, films and talks. With the exception Oo 
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test, almost the same battery of tests was used, but a simple form 
of computation-checking test was substituted for the Arithmetic 
3A test (cf. p. 229), a spelling test was added, and standardised 
stenography and typewriting tests were available for those 
claiming clerical experience. Nearly 40,000 recruits passed 
through this procedure of whom, it was subsequently found, 
94 per cent. were successful in the training to which they 
had been assigned. 


Re-allocation and Conversions 

Probably as much work was done on re-allocation or transference 
of men and women to new jobs as on initial selection. During 
1942-3 some hundred battalions were converted from Infantry to 
other Arms. Later, when the danger of air-raids decreased, most of 
А.Т.5. anti-aircraft employment was closed down. Casualties in 
the armies overseas, or medically down-graded men might need 
new employment. Many units formed before the institution of 
selection procedure contained numbers of misfits who, having 
failed at the first job to which they had been posted, were put on 
to orderly duties or had nóthing to do. Recruits of long service in 
the A.T.S. (enlisted before 1942) could be recommended by their 
С.О. for up-grading to a more skilled and better paid job. Finally, 
Some new need might arise, particularly in overseas units for which 
suitable men had to be rapidly collected from other jobs, as when 
the Army of the Rhine urgently demanded deep-sea divers for 
work at Amsterdam Harbour, and the demand—which would 
normally have taken weeks of correspondence with the units—was 
instantly met by the D.S.P. documentation scheme. 
_ Some of the more straightforward re-sorting could be done by 
Command P.S.0.s or headquarters staff merely through scrutinis- 
ing the qualification forms of recruits who had been through 
General Service, For large-scale schemes, personnel selection 
teams visited the units and applied the usual testing and inter- 
Viewing procedure, Such teams were also set up in Egypt, North 
Africa, Italy, etc., to deal with local problems. In 21st Army Group 
records were collected before D-Day on Findex cards, to facilitate 
selection for special duties and reorganisation or regrouping after 
battle. Every N.C.O. and other rank, in addition to being tested, 
was rated on a number of qualities pertaining to efficiency at a con- 
ference held between a P.S.O. and officers or N.C.O.s who knew 
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the men best. The same technique was of value in disposing of the 
35,000 A.T.S. in A.A. units. 

Perhaps the most noteworthy re-allocation scheme, from the 
psychological angle, was that of the Army and A.T.S. selection 
centres, where misfits, down-graded recruits and the like were 
sent. All were re-tested, medically examined, and very thoroughly 
interviewed by P.S.O.s, anda third to a half were seen by psychia- 
trists. A conference was held between the C.O., the medical 
officer, psychiatrist and P.S.O. at which final decisions Were taken, 
about 30 recruits being covered in 2-23 hours. Those for whom no 
appropriate employment was available could be, and were, dis- 
charged. This system was particularly valuable in raising morale, 
and substantially similar methods were used for dealing with 
escaped and repatriated prisoners of war. Inasingle year (1943-4), 
30,000 to 40,000 men passed through A.S.C.s, and a similar 
number were re-allocated overseas; 75,000 returned prisoners Were 
dealt with during 1945. 


Miscellaneous Psychological Activities е 

The technical staff at headquarters engaged on routine statistics 
and records, development and research, was much larger than in 
the Royal Navy. Some 20 technical officers, P.S.O.s and psycho- 
logists and 50 sergeant assistants were often involved. Test con- 
Struction and follow-up are described in later chapters. Job 
analysis was continually being extended to cover the smaller 
Categories omitted in the first survey, and to allow for any altera- 
tions in the nature of employments. The W.O.S.B.s and А.Т.8. 
had their‘own job analysis teams. "The usual method was for an 
Officer specialising in an employment to collaborate with the 
technical officer. The two visited units to study the recruits at 
their training or on the job. Towards the end of the war, simplified 
editions of analyses of all the jobs in certain Arms were issued 
which aimed to convey to P.S.O.s, by means of diagrams, drawings 
and photographs, as vivid a picture as possible of the work and 
the conditions under which it was done. The minimum standards 
or other qualifications issued for each employment also underwent 
continuous revision. An additional P.S.0. was attached to each 
Arm in order to keep D.S.P. in constant touch with its selection 
problems, and to advise on any changes. : 
One branch of D.S.P. was concerned solely with exceptional 
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recruits who required special employment recommendations. 
Some of these were psychoneurotics; others were men of very high 
intelligence, but with no signs of officer quality or of mechanical 
aptitude—for example, university graduates in languages on 
classics, for whom posts as interpreters, script writers, radio 
announcers, etc., could often be found. Another activity was deal- 
ing with “atrocity stories,” i.e., complaints by senior officers or 
politicians of gross misplacements, or rejections of particular 
recruits. 

Much assistance was given to psychologists in the Allied Forces. 
Thus, the Dutch, Belgian and French Armies set up similar 
selection organisations and translated several of the S.P. tests. 
Several D.S.P. teams were sent abroad to study and carry out 
selection of non-English speaking recruits—Indians, Palestinians, 
East and West Africans, etc. 

Boy tradesmen are recruited for the regular Army, much as for 
the Navy, by academic examinations at about the age of 14, and 
receive some four years’ apprenticeship training. Here, too, the 
value of mechanical tests and of an interview by an industrial 
psychologist were demonstrated (cf. p. 245). 

М The directorate was Specifically established to deal with selec- 
tion. This had the advantage that it could prohibit units from 
applying home-made tests. For example, one mechanic’s branch 


physical development centres or at basic education courses, or of 
membership of Pre-Service Organisations (cf. Chapter XI). But 
training investigations in Particular fell outside the directorate’s 
scope. ' 
\ 
Two memoranda on methods of Army instruction were drawn 
p by Burt and Valentine (1942, 1943) on the basis of replies to 


questionnaires sent to past students who were serving. The main 
defects mentioned were: 
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(4) Too much crowding of instruction; bad organisation of 

courses. А { 

(5) Lack of “learning by doing" ; lack of appeal to interest. 

(6) Lack of visual aids, and of elementary textbooks. 7 

(7) Use of sarcasm or abuse by instructors; unsympathetic 

handling. j dis 

(8) Газбен selection of N.C.O.s suited to instruction; lack 

of training in instructional technique. — 

(9) No homogeneous grouping of classes in ability. 

(10) Haphazard and subjective examination каш ded 

Later, W. Stephenson, who was appointed Consultant · Mr. 
logist to the Army Medical Services, carried out RN it 
studies and other investigations of instruction, recor ы xd 
example, the relative times devoted in specimen periods dem 
Stration, to practice and to oral instruction, and the B Pn 
aids. Several reforms arose out of such enquiries, an e 
of instruction teams" were trained which toured the uni 
Army schools. 

A time and motion study group, headed by Ungecoon (190) 
Was established under the Scientific Adviser to the Army uu > 
which recorded existing drills for gun teams and drew aie xc 
and more efficient methods. For example, the a p» que 
Coastal artillery gun was doubled, the time for un P P de E 
and trailer from a glider was reduced from ten to Bd Ne 
an improved scheme of handling ammunition 1D арар which’ xere 
out. Other problems with psychological bearings 
tackled by the Medical Research Counc 
Biological Research, or the Directorate of Army 
Mentioned in the Expert Committee s керо Ме аршын 
Шаа Gonign endl layout on едш Ж o instion, effects 
controls, effects of poor ventilation on motor E Ex 
ОЁ certain drugs, and morale and disciplinary 5 2 


CHAPTER IV 
ARMY OFFICER SELECTION 


Abstract—The old-type interview boards for the selection of 
officers showed considerable shortcomings by 1941, and new P 
Office Selection Boards were set up, each staffed by a president, 


- or 
Sergeant testers. These conducted much more thorough = i 
three-day examinations of candidates and—whether their metho 


Psychological viewpoints. Psychologists were chiefly confined e 
training, research, and the application of intelligence and pr Иза 
tion tests to candidates. The diagnosis of the character qualitie 


exercises. These were designed not so much to reveal officer tr x 
(e.g. leadership) or abilities, as to bring out the candidates’ socia 
reactions under conditions of strain. 
Scientific value of these diagnostic methods, but they MR. 
have important bearings on the selection of executives, civi 
Servants, and other high-grade Personnel in peace-time. 


In the first two years of the war, 
officers was based on recommendations 
half-hour interviews of the candidates 
The system worked fairly effectively so long as there was a large 
supply of good material, e.g. from the Public schools. But when 
this source began to dry up, the boards, being faced with recruits 
unds were entirely unfamiliar, 
ively. Unsuitable candidates 
Were often passed anq Sent to O.C. T.U. (Officer Cadet Training 

i » With unfortunate effects 
ver, SO many candidates 


the selection of temporary 
from C.O.s and quarter- to 
by boards of senior officers. 
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who might have succeeded were rejected by the boards, often— 
Ero ding to their own accounts—on flimsy grounds such as 
rammar School education or socialist opinions, that recruits lost 
confidence in the system, and there was a real danger of insufficient 
officers being forthcoming. Again, there was less opportunity than 
in 1914-18 for selection on the basis of performance in battle. 
Early in 1941 experiments were carried out by two psychiatrists 
attached to Scottish Command, encouraged by the G.O.C. Sir 
rer 'Thorne, who had previously been military attaché at 
es is and had observed some of the elaborate selection techniques 
OR. oped by German military psychologists (cf. Chapter 1). 
а attending courses at the Edinburgh Company Commanders 
t ool Were given psychiatric interviews and intelligence tests, 
ogether with other tests on the German model. While the latter 
© unpromising results, the correspondence between the psychia- 
Н "e diagnoses and the school's estimates of the officers worth was 
vM striking (cf. p. 160). By 1942 the first experimental “new-type 
Noe Selection Board has been set up. The methods worked 
red ere were adopted by other new boards, a dozen of which were 
Of ed in various parts of the country by October, 1942. A.T.S. 
сег Selection Boards followed in 1943. Later boards, on the 


sd lines, were attached to Armies in the Middle East, India, 
aly and Western Europe. By the end of the war some 140,000 
of whom about 


үне had been through the new procedure, 
0,000 passed. | 
a many popular descriptions have been published that a brief 
fo incor the procedure will suffice (cf. Garforth, 1945; Directorate 
S Selection of Personnel, 1947). Most boards dealt with 120 or 
n 64 candidates a week. Though there were many variations 
tween boards it was usual for candidates to visit for three days. 
e ring this time they filled in a biographical and a medical qued 
th naire, and were given certain intelligence and personality tes 5, 
e latter being of the projection type- A proportion were inter- 
Viewed by a psychiatrist, and all were interviewed by the president 
в * deputy president—senior Army officers. Each batch of зш to 
en was under the charge of a military testing officer (M.T.O.), 
Who messed with them, and did his best to put them аў ease and to 
observe their natural social behaviour. He too applied the series of 
Practical tests such as group discussions and lecturettes, comman 
Situations, obstacle courses and “leaderless group" tests designed 
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to bring out the candidates’ initiative, Co-operativeness, leadership 
and other social qualities. Often a visiting officer, with the presi- 
dent and other members of the board, observed some of these 


ence. At the end of the period a conference was held between all 
members of the board where the judgments of М.Т.О., psychia- 
trist, technical officer if any, and president were compared and 


discussed, and doubtful cases Were considered in detail before the 
President gave his final decision. 
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- patrick (1945) put it, this would have meant introducing «а foreign 


tox уо the tissues of the Army.” Though the aim was to educate 
К imd gradually into accepting scientific methods, the com- 
уса. entually achieved showed considerable technical defects. 
Me the rists, psychologists and M.T.O.s were technical advisers 
wisl president; and each president could run his board as he 
A hed, with as much or as little reference to the technicians as he 
ias subject only to the controlling authority of the Director 
the " е of Personnel, himself a professional soldier. Hence, 
qm ES ens representing the Army, Was responsible for the final 
were ons; hence also a major part was played by M.T.O.s who 
the M a ery officers. This meant considerable dependence on 
gence ые judgment of a single man, and considerable diver- 
RUM etween different boards. The president (or his deputy) did 
jen ope questionnaire on educational, 
DNE istory, and the C.O.’s opinion, as а basi f 
Chi BL but no standard scheme of interviewin 
iefly he attempted to gauge «officer quality" ап 
digg iin into interests and past achievements, and the an 
additi з attitudes towards an officer’s rôles and responsibilities. n 
i OPEM, he tried to assess the relevance of the candidate s quali- 
hi ions to his chosen Arm and could, if he thought fit, recommen 
I$ acceptance for some different Arm. Differences between boards 


Sreatly hampered all subsequent attempts to validate the pro- 
:enificant discrepancies in 


hough these were 


Th acer ably smaller than those found among old-type Я 
Gude in one experiment where strictly comparable groups of 
23 idates were sent to two boards, the respective pass rates We 
do er cent. and 48 per cent. This, however, 
eme case. ^ 
'The Psychologist ; 
A chief technical officers amem of five or six fully qualified 
Psychiatrists and psychologists, some senior M. .O.s, and Б 
trained assistants. They were responsible for developing 2? 
novne the methods used by all the boards, but as their ү 
tons were advisory only, they had very little control. For the mos 
part they were isolated in the research and training centre ( zuo. 
Where they were not given any opportunity; until the end о 
War, to carry out day-to-day selection themselves. They had little 
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contact, also, with the technical staff of D.S.P. working at the War 
Office, since their approach—based mainly on psychopathology 


R.T.C., board psychologists wer 
as assistants to each psychiatris 


е sergeants, three of whom acted 
t; they did not attend the final 


conferences. Another reason for the minor réle of psychology was 


that officer-suitability appeared t 


о be chiefly a matter of character 


and personality, and in the absence of objective tests of the 
desired qualities, interview techniques which psychiatrists them- 
selves had evolved Successfully at Edinburgh constituted the most 


promising approach. 
The sergeants’ functions wer 


metric. They applied and Scored intelligence tests, but also gave 


the projection tests and to some 


these, with a view both to Separating off the candidates on whom 


the psychiatrist could Most pri 


whom he need not interview, and to feeding him with information 


equivalent Scores, the sum of which was then converted into ап 
officer-intelligence rating. This rating ranged from 10 to 0, with 


3 corresponding approximately 


very little stress was laid on a high rating; candidates scoring 10 
were not looked on any more favourably than those scoring, say, 6. 
Several other tests were available for special purposes such as the 
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Particularly in the early days of the Edinburgh Board, though no 

. definitive results were obtained. An assembly test for engineering 
cadets was devised to yield qualitative ratings of mechanical 
Interest and comprehension, methods of work, etc., rather than 
quantitative scores. A detailed information test on workshop рго- 
cedures was also prepared for R.E.M.E. cadets. 

In 1944-6 standardised educational achievement tests were con- 
Structed for such groups as schoolboy candidates for university 
он courses, which were intended to supply potential officers for 
те Arms, and for candidates for Army College courses. 

hese included objective examinations in arithmetic, algebra, 
ed metry, calculus, heat, light, electricity, mechanics, general 
er current affairs in history and in geography. The tests were 

School certificate to higher certificate standard. 1 
aim he first questionnaire has been mentioned above. Its prime 

Was to indicate the candidate’s opportunities and what he ha: 
fen of them. The second asked for various details of medical and 
al history, and was seen only by the psychologists and peu 
in 5. To some extent this acted like the personality inventor 

Which neurotics tend to check large numbers of symptom? (сї. 
Р. 287). Candidates were also required to write two brief descrip- 


Bins of themselves as seen first by a good friend, secondly bya 


Severe critic, Sergeants were trained to extract from these docu- 
те the points of major psychological interest. In the two group 
Projection tests, direct self-reference was intentionally avoided. x 
ly-word free association test was given with the title “Test Y 
AES of Imagination." The stimulus words were pee 
Mite ly for fifteen seconds each, candidates being ieee $o 
Кеп down the first idea that came to mind, either y s 
Oo PS Some of the words were originally on D 
Sui iguous meanings, e.g. “lead,” it being hoped that t p 
table candidates would react to their military meanings. 


5 feature possessed no diagnostic worth. А 
азбу, six of Murray’s Thematic Apperception ae xs 
to = by slides for four minutes each. Candidates were an 

Write short stories describing the social situations sugg 

“tgeants received training, both at R.T.C. and from their m 
to chiatrist, in the interpretation of responses to these projec’ 
S, neither of which was objectively scored- 


On an average 
aa i i jonnaires ап 
fan-hoür was available for reading the question: 
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projection responses and integrating their deductions into a sum- 
mary of the salient personality features, according to a more or less 
standard scheme. Note was made of the frequency of appearance 
of such attitudes as anti-social, disruptive or cohesive, also of 
undue morbidity, or of special types of symbolism. Degree of self- 
insight, breadth of outlook, and extent of identification with the 
Army’s needs were assessed. Admittedly the technique was largely 
intuitive, and, although investigations into reliability and validity 
were planned, it was not possible to complete them. Nevertheless, 
a moderate degree of consistency was found between different 
sergeants judging the same material. Moreover, when a large 
number of psychiatrists and psychologists independently inter- 
preted the pointer material of 38 candidates, the promising correla- 
tion (cf. p. 104) of 0.58 was obtained between their modal judgment 
and the final board grading. 

With the provision of more commissioned psychologists towards 
the end of the war, it was arranged that candidates whose pointers 
were not followed up by psychiatrists, and whose personalities 
appeared less complex, should be interviewed by a psychologist, 
though his province was strictly limited to “surface” traits and 
sentiments, At the end of 1946 the shortage of staff became so acute 
that both psychiatrists and commissioned psychologists were with- 
drawn from the boards altogether. 


The Psychiatrist 

_ Since there was only one psychiatrist at most of the boards, and 
since his interviews lasted from 20 to 60 minutes, the proportion 
of candidates seen had to be limited. But an additional reason for 
this was the manifest desire of Army authorities to reduce his réle 
to a minimum. While their suspicion was natural enough, it was 
certainly misguided. Only those who were ignorant of board pro- 
cedures and of the nature of the discussions at the final conference 
could continue to believe that psychiatrists spend all their time 
smelling out sexual complexes. It is true that in occasional cases 
of superficially well-adjusted Personality the psychiatrist might 
discover psychopathic or neurotic tendencies and recommend 
their rejection. But at least as often he drew attention to underlying 
potentialities and strengths in outwardly unimpressive or diffident 
personalities which other members of the conference had been 
inclined to reject. His services as a purely medical examiner were 
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also sometimes needed. But the main value of the psychiatric 
approach, and the reason for its success in the early experiments 
may, perhaps, be expressed as follows. 

r he boards were concerned to select candidates who would: 

(i) Pass their O.C.T.U. courses, and acquire the necessary 

7 technical proficiency. 

(ii) Stand up well to the stresses of battle, showing resource, 
leadership, aggressiveness and caution when these qualities 

‚ Were needed. 

(iii) Look after his men well and gain their confidence. 

(iv) Co-operate effectively with other officers and make a good 
с соп on his seniors. - : И 
Chief o ner róles might be mentioned, but these d g the 
avail aes With most of the candidates little direct evi a. Y 
aa able, particularly regarding (ii) and (iii), thus the board һа 
A Sag future behaviour, often in young and immature person- 

105, from the indirect indications of past history. Now it 15 the 
medical psychologist—particularly the psychoanalyst—who has 
Уе chiefly responsible for introducing the gaa generin 
ne pted by contemporary psychologists, that our behaviou i 

ver a random or chance reaction to the circumstances of the 
киси, but is always determined by or arises out of that Sn 

9n of innate and acquired tendencies which we call the tota 


Personality, Further, that we are not conscious of the true nature 


© + 
Ё many of the most important of these tendencies. Thus а more 
istics and a more 


Complete explanation of our present character 

жае prediction of future indt is likely to be obtained by 
Xploring the underlying mechanisms, and this the bi 
ү ologist is particularly well fitted by his training to ШИП r 
pud others, such as the normal psychologist or the d 
s 151, are often capable of equally penetrating diagnose". favin Е 
Studied abnormal and disintegrated personalities the psychiatris 


1S quick to recognise the relatively well-balanced and Erb 
Personality, and one of the principles Upon whic dd 
PSychiatrists worked was that it is the harmoniously crt. 


"Пап who is most capable of showing aggressive or cautious, M 
ferential, firm or sympathetic characteristics ghen 1 л 
Бе needed, and is least likely to give way under ү bh 
pull be noted that most of the desirable officer 948 m m 

of appropriate reactions to people; there is little of the job W. 
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will not be carried out in a social context. Medical psychologists 
believe, rightly or wrongly, that adult social reactions are based 
largely on the attitudes to parents and other humans established 
in infancy and early childhood, hence the relevance of their 
enquiries into family history and childhood memories. Among 
A.T.S. officers the social side is if anything more important, since 
they are not usually required to be technical specialists and will 
seldom be subjected to operational stress. Thus the personality 
organisation and attitudes to people desirable in a woman officer 
(which may be very different from those in a man) was the subject 
of special study, largely by women psychiatrists. 


The Military Testing Officer 

The M.T.O. was originally introduced as a “cover plan"; he 
was to apply tests of a military nature which would impress the 
Army while the real job of selection was undertaken by the tech- 
nicians. However, his functions were progressively clarified, and 
their value increased, through the investigations at the research 
and training centre. In the first experimental board he tried 
out some of the practical situation tests used in Germany, and 
improvised others. For example, he took his group of eight 
candidates to a stretch of country and there posed a series of 
tactical or other problems to candidates in turn. Each candidate 
again might be put in charge of the rest so that he could display his 
handling of them in military situations, Obviously, the success of 
candidates at such tests is likely to depend as much on their pre- 
vious training and experience as on their personal resourcefulness 
or other qualities, Group athletic games and elaborate obstacle 
courses were also devised, with a view to bringing out co-operative- 
ness, aggressiveness and “guts,” quick judgment, endurance and 
the like; but physical status and bodily skills probably entered too 
largely. Lecturettes, given with or without preparation, and 
informal discussions were supposed to show fluency, confidence 
or social aplomb, together with acceptable or undesirable attitudes 
towards the topics discussed, e.g. discipline. M.T.O.s were usually 
men with battle experience, whose age was fairly close to that of 
the majority of candidates. Hence their personal experience of the 


they had no systematic instruction in the assessment of personality 
or avoidance of halo. Although many of the tasks appealed to the 
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candidates’ sense of fitness and fairness—also to outside critics— 
they Were quite unstandardised and unscoreable, and their 
diagnostic worth was extremely dubious. 

i the view of the R. T.C. psychologists, the search for particular 
traits characteristic of officers, and for tests to measure these traits 
Was likely to be a waste of time. Influenced by the field theory 
approach of Lewin, Moreno and other writers in America, they 
SN d not accept traits as predominantly constant qualities of an 
individual, existing independently of the context in which they are 
Eo Successful officers do not all show the same traits; thus, - 
Bde d seem to be the total configuration of traits in their p 
qs rather than the individual traits which makes for success. ; 
certaj s that a candidate should not be thought of as possessing | 

"in E amount of leadership which he can display both in ay ane 
Pie. ide situations. His personality is an organised yos Ta 
ү €m of tensions or needs, which interacts dynamically М r 
t pus demands of different situations. “Officer quality 5 d 
will MM be analysed in terms of the main róles that future of is o 
Syste е called on to play. By setting appropriate tasks a si ls 
can m of forces can be set up at the W.O.S.B., whose E z 
candidat be observed by the М.Т.О. or other board Rr E x 
E ate’s most important róle will be that of leader © ae 
d ; and he should be able to uphold his own position S 
its Up, to give the group direction, and at the same time 1 A 
cohesion or solidarity against internal or external disrup 
Orces, 


It was from this theoretical back dt 

groun h 

Con less group tests was evolved in 1943 by Bion tesis wale 

a eed the most original feature of W.O.S.B. proce 251 

is oi erless group test, a group of, say, eight candidates ait pios 

a Blven a task, and left to work out its own solution. No lea s 
Ppointed, and the M.T.O. is an entirely neutral observer. ac 


Candia... | pun 
hae idate is out to distinguish himself, but the task is so designe 


St this is on „In other 
this is only possible through the success of the E Sos 


W . 

и he is forced to submerge himself and ea P E rud 

vides ike Each candidate resolves this conflict 1n. peer: 
Mie. manner. Some become mere “passengers Ais T ii 

Nat Т own desire to shine; others become “thrusters wW Au s 
€ group interests to their own. More effective solutions 


aneously 
cen these extremes, are shown by those who spont 


hat the basic series of 
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attract the confidence of their fellows, and who come to the 
fore although working for the group's success. Thus the reactions 
of candidates to such situations are considered to be diagnostic of 
their behaviour towards their platoons under Active Service con- 
ditions. Bion (1946) writes: “The essence of the technique . . . was 
to provide a framework in which selecting officers, including a 
psychiatrist, could observe a man’s capacity for maintaining 
personal relationships in a situation of strain that tempted him 
to disregard the interests of his -fellows for the sake of 
his own.” 

The tasks were designed to be “real” ones which would bring 
out the candidates’ natural behaviour, not artificial test situations. 
To the candidates or the uninitiated observer they appeared to 
Pose a practical problem, but this, of course, concealed the under- 
lying social problem in which the selection staff were interested. 
The basic series of tests, although run by the M.T.O., came to 
provide а сопипоп meeting place for all members of the board. 
They found that they could more effectively apply their particular 
techniques after this preliminary observation of the candidates’ 
personalities “іп action." The tests provided an approach comple- 
mentary to that of the interviewers. The psychiatrist, for example, 
could link up the abnormal adaptations of candidates to one 
another with the adaptations to family, school and occupational 
environment traced out in his life-history interview. The series 
included such tasks as moving a heavy object over a series 0 
obstacles, or erecting some apparatus, or having a discussion ОГ 
debate on some topic, or planning an administrative scheme—$ay 
à system of training for future recruits. The situations were 50 
arranged as to display the formation of initial social contacts 
between members, their behaviour when co-operating on a com- 
mon task, their reactions to internal disruptive forces (competition 
between members) and to external forces (competition of one 
group with another). 

_ Later, other tasks designed to reproduce such officer rôles 25 
independent command, as instructor, or administrator, and 4$ 
subordinate to senior officers, were posed by the M.T.O. ue 
selecting specialised personnel such as applicants for regular 
Army commissions, colonial administrators, university short 
course candidates, and the like, other appropriate *«work-sample' 
Situations were devised. Sometimes a task extended over three 
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hours, the group having to study certain material, plan its course 
of action, and report. f 
к» ay рк each member studied the candidates inde- 
eumd A the final conference. This led to a somewhat 
оа е, mosphere, since members felt constrained not to 
i con current group even informally, and serious disagree- 
a ED occurred at the conference, when it was too late 
SIE intr e urther investigation of the doubtful candidates. Thus 
jd E y e of collaboration and mutual consultation at all 
the rm билд to have considerable advantages. Often most of 
EE cut ers would recognise certain candidates as clear passes or 
the Tum Y on, and so feel free to concentrate their study on 
са dn ine or controversial cases. The system had, however, 
Bible to rem UNE from the scientific angle, since it became impos- 
nvestigate the reliability or validity of the contributions of 


the ; 
various parts of the board procedure. 


Miscellaneous W.O.S.B. Activities 


Another function primarily carried out by M.T.O.s was the 
covering. the life and 


im . + 

поа опе of preparing job analyses, 

in m of each of the main types of officers to be selected. Later 
е war, R.T.C. and the boards undertook numerous more 


Specialised jobs including: 
(1) Selection of adolescents for eventual training as officers after 
2) d iods of study at universities or technical colleges. - 
iversion of promising candidates who appeared too imma- 
ture for commissions to special courses of training designed 
д to develop “leadership.” ` i 
(3) Selection of experienced personnel for special duties— 
Palestine Mobile Police Force, Civil Affairs officers for 
administration in the Far East, etc. also classification of 
(4 таеп engaged in the psychological warfare branch. 
) Guidance of certain classes of officers into suitable employ- 
ment, including psychiatric disability cases, officers who 
were surplus in their old jobs, officers who had received 


adverse reports, and—in particular —repatriated prisoners 
cer Resettlement 


of war. The latter were dealt with in Off t 

Units, where tests were dispensed with and everything was 
planned to reassure the men and to readjust them to normal 
life. Similar principles were extended to the handling of 
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other rank ex-prisoners in Civil Resettlement Units, and to 
all demobilised soldiers who felt the need of help in their 
readjustment. Accounts of this extensive experiment in 
psychotherapy and its successful results have been given by 
Wilson (1946) and Curle and Trist (1947). 

Selection of war-time temporary officers for permanent 
commissions in the regular Army, and the development of 
a technique for selecting temporary officers in the peace-time 
conscript Army. 

(6) Assistance was given also in the selection of Royal Marine 

Corps officers, paratroop officers, and in an experiment in 
the organisation and methods division of the Treasury. Е 

Since the war many of the R.T.C. staff have joined together in 
the Tavistock Institute of Human Relations, and are applying the 
Principles and techniques evolved in the Boards to the study of 
industrial, educational and other civilian problems (cf. Jaques, 
et al., 1947). 

Civil Service Selection Boards (C.S.S.B.s) have also been set 
up to advise the Civil Service Commission on the choice of can- 
didates for the administrative class and foreign service. But these 
differ markedly from W.O.S.B.s. Their chairman and observers 
have somewhat analogous functions to those of the president and 
M.T.O.s, but psychiatrists are not employed and the psychological 
staff consist of ex-industrial psychologists. Group discussions, 
planning problems and the like are applied, though with the object | 
of throwing light more on the quality or calibre of the candidates 
intellectual-powers than on their social adjustments. 


(5 


— 


Discussion 
'The reader may conclude that W.O.S.B. methods were essen- 
tially similar to those of German military psychologists, which we 
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of the leaderless groups or other M.T.O. techniques were stan- 
dardised tests; no measurements were taken, and only to a limited 
extent were observations recorded under any standard scheme. 
Although these techniques aimed to get away from what Gestalt 
psychologists such as Brown (1936) call “Class Theory”, i.e. the 
analysis of personality into a number of self-sustaining traits, it is 
likely that most of the psychologically untrained observers con- 
tinued to treat them as tests of leadership, co-operativeness, and the 
like. Again, field theory is as yet largely a priori and empirically 
unverified even though it is not—like German theories of character 
g unverifiable. There is no evidence that the consistency of 

reactions to group tensions" is any greater than the consistency 
of conventional personality traits. We know, for example, that a 
candidate who displays “leadership” in an artificial test situation 
will not always show the same trait in real life, but it is doubtful 
whether his performance in leaderless group tests will be any more 
Prognostic. Indeed, such tests are open to serious criticism from 
field theory itself. Every group of candidates constitutes a unique 
combination or Gestalt, hence any one candidate’s behaviour is 
likely to vary considerably with the make-up of the group. Though 
faced with objectively the same task, his reactions would probably 
differ if he was tested along with a different set of candidates. This 


objection was, however, partially met by the common practice of 
M.T.O. tests. As there 


re-grouping candidates in later series of M. 
were always several sets running simultaneously at а W.O.S.B., 
the best one or two from each set might be put together, or all the 
borderline candidates, and so on. Я 
_Another fundamental difficulty is that the intense desire of can- 
didates to pass leads them to "put up a show” which may not be 
typical of their everyday behaviour. All sorts of distorted accounts 
of what the boards were looking for used to circulate among 
potential candidates, and they would naturally tend to mould 
themselves accordingly during the two or thrée days examination. 


It is known, for example, that many arrived. with ready-made 
i based on accounts 


Stories to fit the thematic apperception pictures, 
gleaned from previous candidates. ; 

In justification for some of these weaknesses, 1t should be 
Pointed out that the W.O.S.B. system was not developed, and 
Should not be regarded, purely as a diagnostic technique. Under 
the circumstances of the war, the fact that it won wide acceptance 


P.S.— 3 
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from the Army and stimulated the recruitment of candidates—in 
other words its psychotherapeutic effect—was at least as important. 
It was extremely valuable in that the Army was led to believe that 
it was thereby getting the best possible officers, even though its 
methodological shortcomings were such that it probably made 
numerous incorrect choices. But other selection techniques too are 
far from perfect, and, like other techniques, it was certainly an 
improvement on older methods. 


Since the war there has been a growing tendency for the selec- 


tion of managerial, professional and other high-grade workers to 
be modelled, to some extent, on the W.O.S.B. plan (cf. Fraser, 
1947; Hoovers, 1948). One to three day meetings are held instead 
of half-hour interviews; psychological tests, group discussions and 
exercises are introduced, and a qualified psychologist usually 
assists the employers’ selection committee. This has several 
advantages: А 

(1) It provides amuch larger sample of the candidates’ behaviour 

on which to base judgments. 

(2) It enables the selectors to observe the social interplay of 

candidates collaborating or competing with one another. 

(3) It appears manifestly fair to candidates and to employers, 

and so stimulates confidence in the system, whether or not 
this system is entirely sound scientifically. 

But a good deal of caution is needed in adopting the W.O.S.B. 
methods since, as shown later, their value has not been sufficiently 
thoroughly demonstrated. It should not be supposed that the 
introduction of a few situational tests, apparently analogous to 
situations involved in the job, provides the key to accurate assess- 
ment of personality and job suitability, nor that the fallibility of 


human judgment and the need for scientific validation are lessened 
thereby. 


7 


| 


CHAPTER У 
SELECTION IN THE ROYAL AIR FORCE 


Abstract.—Although aviation psychology had been started in most 
Air Forces during the 1914-18 war, research did not continue after 
the Armistice, with the result that in 1939 the R.A.F. was almost 
entirely dependent on the unstandardised interview in its chief 
selection situations. 

\ In 1940 a few psychometric tests were introduced under the 
auspices of the Air Ministry Medical Branch to aid the Aviation 
Candidates Selection Boards. This testing procedure was under 
the technical supervision of the Cambridge University Psychology 
Department where-the first groups of W.A.A.F. testers (clerks 
personnel selection) received their training. 

In 1941 Ground Trade Selection tests (including G.V.K.) were 
devised and introduced by Stephenson (Oxford University) who for 
two years acted as personnel adviser to thé Central Trade Test Board. 

Finally, in 1943, the technical responsibility for all selection test 
Procedures became centralised at Air Ministry, where Bott 
(Toronto University) had founded a training research branch late 
in 1941. The terms of reference of this branch (now known as 
Science 4) made it responsible for advice on all psychological 
Matters arising in the Air Force outside the realms of physiology 
and psychiatry. 

The early energies of Training Research were very largely devoted 
to the reduction of wastage in air crew (especially pilot) training. 
In 1942 grading (standard flight testing) was introduced; it resulted 
almost immediately in a 50 per cent. drop in pilot training wastage. 
Two years later a large battery of tests was assembled with the 
Purpose of obtaining aptitude indices for each category (pilot, 
Navigator, etc.) in respect of all accepted air crew candidates. 
Reduction in training programmes since 1944 has made valida 
difficult, but large samples have been. followed up wherever 
Possible. 

In recent years research has been made into the personality (as 


tion 


+ distinct from the skill) requirements of air crew. Technical effort 
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has also been expended in the selection of the officer and appren- 
tice populations, while at the moment an amplified ground trade 
test battery is in preparation. 

In addition a good deal of work has gone into the problem of 
training assessment (both ground and air), while two other highly 
important fields (instructional technique and morale survey) claim 
most of the residual energies of an all too small branch. 


The seeds of aviation psychology were sown in the first World 
War, Italy (Gemelli) and the United States (Henmon and Thorn- 
dike) making the most substantial contributions. But, unfortun- 
ately, psychologists were nowhere given a chance to follow up their 
openings in the years after 1918 with the result that the second 
conflict found the majority of Air Forces unprepared for sudden 
large-scale expansion. Thus, in 1939, the R.A.F. was relying 
almost everywhere on traditional methods to select and place its 
men. The claims of air crew applicants were vetted by a series of 
Service boards who so far as interviewing was concerned were 
unselected, untrained and supplied with no technical aids. Simi- 
larly the selection of officers was entrusted to boards with little to 
guide them beyond the knowledge that all the candidates they saw 
came with some sort of recommendation from a commanding 
officer. Ground trade applicants who claimed relevant civilian 
experience were tested in respect of these claims by expert trades- 
men; but these same tradesmen were required, with no special 
training, to undertake the much more general problem of assessing 
the potentials of the great majority of recruits who made no claim 
to any special skill. Such very briefly was the picture of selection in 
the first autumn of the war, and it has to be admitted that some of 
the features indicated were slow to change appreciably even after 
the notion of selection as a science had taken fairly firm root. 

The introduction of systematic selection in the Air Force was 
more gradual and less unified than in either the Army or Navy. 
"Three distinct phases are to be noted: 

(i) 1940-2. Air crew selection under the technical guidance.of 
F. C. Bartlett (Cambridge University), introduced and 

.. Sponsored by the Medical Branch of the Air Ministry. 

(ii) Ground trade selection initiated and developed by W. 
Stephenson (Oxford University) acting as personnel adviser 
to the Central Trade Test Board, West Drayton. 
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(iii) December, 1941. Arrival at the Air Ministry of E. A. Bott 
(Toronto University) to found Training Methods (later 
Training Research and now Science 4). From this point all 
work on selection and training research gradually became 
centralised under a single authority. 

The early work can most easily be discussed by considering 

each of these phases in turn. 

The “Bartlett Tests.” —Early in 1940 the Cambridge Psycho- 
logical Laboratory was invited to produce a general intelligence 
test to be given at the Aviation Candidate Selection Boards. The 
time allowance (15 minutes) was hardly liberal, but the introduc- 
tion of any standard test, however brief, must be accounted a step 
forward. Three parallel forms of a 20-item verbal intelligence test 
(С.І.Т.) were prepared and made available to all A.C.S.B.s in the 
middle of 1940, The tests were at first administered by orderlies, 
but it quickly became obvious that specially trained staff were 
needed, and a trade (clerks personnel selection) was created to 
meet this requirement. The results of the tests, expressed in 
5-letter grades based on a 1-2-4-2-1 division of the scores of 
applicant populations, were available for the guidance of board 
Presidents, but no cut-off levels were laid down, the presidents 
being free to use their discretion in interpretation. General advice 
Was, however, given on the scores below which admission to (a) 
pilot and navigator categories, (b) wireless operator and air gunner 
Categories was deemed problematical. 

At the same time as the G.I.T. tests, a 15-minute elementary 
mathematical test was introduced and used as a very approximate 
guide to educational attainment. Both the G.LT. and E.M.T. had 
the merit of being objectively scored, the scores being evaluated 
against standard grades. A third written test (general knowledge) 
consisted of two fifty-word essays the purpose being to afford some 
clue to powers of expression and alertness to current affairs. Only 
à very slight reliance on this test was, however, encouraged, the 
risk of subjectivity in marking being considerable. R 

The three tests just described remained part of the standard air 
crew selection procedure for more than three years. They may not 
have made a large contribution to the training wastage problem, 
but they undoubtedly helped to make the Service test-conscious, 
and in doing this paved the way for the much fuller testing pro- 
grammes of later years. In the beginning of 1942 the battery was 
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augmented by the S.M.A.3 (pilot co-ordination) test (cf. p. 248), 
and a sub-battery of three tests—directions, tapping and morse 
aptitude—aimed at the elimination of unsuitable wireless operator 
trainees. S.M.A.3 was the work of G. O. Williams (Central Medical 
Establishment, R.A.F.), while the wireless battery was assembled 
by G. C. Drew (Cambridge University), who later came to do full 
time work at the Air Ministry. The latter tests were introduced 
executively following an experiment at Blackpool which led to a 
sharp reduction in wastage in the basic signals course. 

The “G.V.K.” Tests.—In the early part of 1941 Stephenson 
came from Oxford to take up full-time duties as personnel adviser 
(ground) to the Central Trade Test Board, with whom he remained 
for more than two years. Very little experience of trade allocation 
problems convinced him of the need for a test whose first function 
would be to sort the large and diverse weekly intakes into some half 
dozen broad ability zones. For this purpose categorisation on the 
most general lines was required, the question of group and specific 
abilities not being of paramount interest at this stage. Because, 
however, sharply contrasted trades, e.g. clerical and mechanical, 
were liable to make demands for personnel at each ability level 
there was a clear advantage in designing a two-purpose intelligence 
test which would give pointers in respect both of general ability 
and of strong leanings towards practical or theoretical work. The 

| three-part test known as G.V.K. is well designed to do both these 
things. A reliable estimate of a recruit's general level can be 
obtained by averaging the sum of his three scores. But as the HAUS 
section has a considerable language (as well as verbal intelligence) 
content it serves as a rough guide to educational attainment and 
suitability for sedentary occupation, while “K” directs attention 
to the possession or non-possession of practical-mindedness. А 
simple arithmetic test was added to this short battery which in all 
occupied rather less than an hour of testing time. 

To understand how the above tests were used it should be 
explained that during the war years the ground trade problem was . 
one of trade recommendation, not of overall acceptance or rejec- 
tion. In this it differed sharply from the air crew situation where 
initial rejection rates rarely sank below 50 per cent. But the R.A.F. 
ground population was in effect no different from the population 
applying for entry and the divergences of both from the civilian 
population were probably small (cf. p. 198). 
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Recommendations to ground trades were made by interviewers 
chosen primarily for their experience in certain technical trades. 
These were the people whose task the test battery was intended to 
assist. For such aid to be effective, guidance was given on the mean 
test scores found among trained personnel in each R.A.F. trade, 
and on a “failure” score for each trade, i.e a score below which 
success in training had been demonstrated to be small. (Samples 
of this are given in a later Chapter.) Scores were presented in 
percentile form, the basic population being a random sample of 
5,000 drawn from early 1942 intakes. The norms established at 
this date have been held constant ever since, but revision in terms 
of current N.S.A. and volunteer intakes is intended. 

Specialised tests were devised only in respect of one or two 
Priority trades, e.g. radio wireless mechanics. These were given 
at the interviewer’s request to recruits found potentially suitable 
for such trades. It will doubtless be thought that a battery of four 
short general tests and two or three special ones is a very slight 
instrument with which to attempt allocation to more than a hun- 
dred occupations. Unquestionably an ampler battery would 
be desirable and it is hoped to introduce one in the com- 
paratively near future. Meantime one can note the ingenuity 
and economy with which the G.V.K. test performs its double * 
function. Р 

Training Research.—Both air crew. and ground trade selection 
were thus already under way by December, 1941, when Bott, with 
C. R. Myers as his assistant, took up his Air Ministry post as adviser 
to the Air Member for Training. For twenty months an embryonic 
branch was concerned almost exclusively with air crew problems, 
particularly with that of drastically reducing pilot training wastage. 
Its terms of reference were, however, exceedingly wide; so wide, 
In fact, as to cover almost any application of psychology anywhere 
in the Air Force. It was thus possible when Stephenson relin- 
quished his post in July, 1943, for ground trade selection to be 
brought under the same head as the selection of air crew, while the 
fact that the advisers had the right of entry to bóth selection and 
training problems made possible a longitudinal survey of the air- 
man's whole career. That the study of selection and training 
has proceeded unevenly, the bulk of enquiry in the early years 
having been directed to the former, must not be ascribed to any 
maldistribution of effort. It has resulted simply from the fact 
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that it is comparatively wasteful to undertake detailed research 
into training methods until something has, been done to ensure 
that approximately the right people are entering training. 

Between 1942 and 1945 Training Research (the name was altered 
from Training Methods early in 1943) expanded from a section 
with three or four research workers to a branch with a provisional 
scientific establishment of seventeen. During most of this period 
technical staff were recruited by a series of special appointments, 
but in the autumn of 1944, it was decided to bring these temporary 
appointments within the framework of the Scientific Civil Service. 
During a large part of 1945 nearly all the established posts were 
filled, though lack of civilian psychologists led to a number being 
taken by serving officers with psychological qualifications. Since 
the end of the war, however, the strength of the branch has 
declined by over 50 per cent., the chief reasons being demobilisa- 
tion and delays in deciding the future status of psychologists 1n 
Government employment. At the moment of writing (December, ' 
1947) the small staff remaining is being forced to spread its energies 
among a greater number of problems than it can possibly do full 
justice to, while other problems, some of great urgency, cannot be 
tackled at all. 

From July, 1943, when A. J. Marshall took over Stephenson's 
work on ground trade selection, the branch was divided into two 
sections, the first dealing with all air crew matters (whether selection 
or training) and the second with ground trade problems. Later 
re-organisation brought all selection problems (air and ground) 
together, a second. section being set up to deal with training 
problems. 

The functions of the branch have throughout been advisory. In 
the early days responsibility was to the Air Member for Training 
alone; later, as the emphasis shifted back to the earliest stages of 
selection and as the desirability of a centralised selection system 
came to be appreciated throughout the Air Ministry, advice was 
increasingly sought by the Air Member for Personnel also*. 


* In 1947 a Deputy Directorate of Selection was formally established to con- 
trol the policy and administration of all selection procedures throughout the 
Service. This personnel branch had, in previous years, done much to raise the 
general standard of service interviewing, particularly in the fields of reselection 


of air crew training failures, reallocation of tour-expired air crew and the dis- 
posal of former prisoners of war. 
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Finally, as Science 4, the branch became part of the Scientific 
Adviser’s organisation and through him responsible to the Chief 
of Air Staff, 


Training of Testing Staff 

A short account of the selection and training of testing staff may 
be of interest. From 1942 to 1946 Clerks P.S. consisted exclusively 
of women, but the quickened release of W.A.A.F. brought about 
an almost complete change-over to R.A.F. personnel. It is antici- 
pated that the final constitution of the trade will provide for equal 
Proportions of men and women. As regards selection two main 
Points have been established. First, it has been found unprofitable 
to train anybody below the top quarter of the R.A.F./W.A.A.F. 
Population in general intelligence. Secondly, great pains have to 
be taken to ensure that every volunteer knows what the duties of 
ыч е trade involve. The second point may seem obvious, but very 
little experience showed that the notions of many early applicants 
as to the nature of psychological work were wild and sometimes 
crude, A number assumed, rather naturally, that personnel selec- 
tion duties entailed a great deal of individual interviewing, while 
others, less creditably, looked forward to a banquet of Freudian 
symbolism. \ 
- Pre-selected personnel received their training in courses origin- 
ally of three weeks duration, but the length of these courses has 
Brown to six weeks, as the scope of the work has developed. Not 
only has the number and variety of tests and testing situations in 
the R.A.F. increased steadily since 1942, but features other than 
test administration have been added. Thus while the prime aim 
of the courses has’ always been to bring testers to a high level of 
Proficiency in giving and invigilating tests under standard con- 
ditions, it has also been found profitable to give a grounding in 
Statistics, test construction and research planning. As a result of 
this the limited resources of the scientific staff have been saved 
from a mass of semi-technical effort which could never have been 
entrusted to untrained assistants. In addition the existence of a 
Corps of intelligent and enthusiastic testers has been of inestimable 
value in spreading an enlightened conception of psychological aims 
among the Service. ' 
A great deal of thought has gone to 
Content and syllabus structure. Instructiona. 


the question of course 
] periods are divided 
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almost equally into practical (i.e. exercises in the giving of patters, 
demonstrations by wire recorder, marking, practice, etc.) and 
theoretical (mainly but not entirely lectures). Lectures include 
three main subjects, principles of mental measurement, statistics 
(where each lecture is followed by an hour of practice) and R.A.F. 
selection situations. About three-quarters of all the instruction 
is given by the school staff (personnel selection officers), the 
remainder being covered by visiting lecturers from Science 4. 
During the last week a passing-out examination takes place, the 
main emphasis being on practical testing with objective tests on 
testing theory and statistics. 


Air Crew Selection 


The entire choice of air crew (medical issues apart) was entrusted 
at the beginning of the war to boards of serving officers who were 
afforded no more definite briefing than to find the right types. 
"Three assumptions can be detected behind this: (a) that there are 
right and wrong types for air crew duties, (b) that such types can 
be detected by simple interview, (c) that officers who have them- 
selves been air crew are particularly suited to do the detecting. 

The questionableness of these assumptions needs no elaboration; 
more important is it to appreciate the complexity of what the 
boards were asked to do. First, they were expected to decide not 
merely who should be accepted as air crew and who should поб! 
but also to which air crew category an accepted recruit should be 
sent for training. Secondly, the boards were given no guidance 
as to the relative importance of personality and skill factors in 
reaching these decisions, and thirdly, they were provided with n° 
technical aids with which to measure either, The introduction © 
the Bartlett tests marked a small but significant step towards the 
solution of the last issue. Whatever intelligence, educational 
attainment and sensori-motor co-ordination may be, they clearly 
fall outside what are ordinarily considered personal qualities. This 
distinction once made, the path was to some extent cleared for the 
separation of skill assessment from the appraisal of character. This 
separation was carried much further by the introduction of the 
pilot "grading" testin 1942, and was completed when the Air Crew 
Aptitude Test Battery was brought їп during the spring of 1944. 
From that point the job of the selection boards was to evaluate 
personal suitability for air crew, the allocation to categories being 
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undertaken at a later stage and being determined by aptitude and 
preference*. 

Towards the end of 1941 the decision to move flying training 
overseas focused attention on the selection of pilots. This was the 
first concrete problem Training Research were called on to attack 
and their initial analysis showed some alarming figures. It was 
found that of many hundreds who had commenced pilot training 
two years earlier only 41 per cent. had entered on operations, 
while 36 per cent. had failed to qualify. Moreover, two-thirds of 
these failures were falling short at the elementary flying stage which, 
in other words, was acting as а selection screen so far as flying 
ability was concerned. This was uneconomical enough when train- 
Ing was in this country, but it would clearly be insupportable with 
schools spread over half the globe. ; j 

The problem was far too urgent to allow of the evolution of a 
test battery even had such a battery seemed the most likely way 
to reduce wastage. It is, however, extremely questionable whether 
Such an approach could ever provide as good an answer as the work 


Sample method actually adopted. This method entailed a short 


Period (12 hours) of flying training with standard tests at pre- 
Scribed intervals (originally at 7 and 11 hours, later at 53, 8} and 
11} hours). 

It will be seen that such a selection system assumes a close 
correlation between speed of learning and the ability subsequently 
shown. This assumption was firmly substantiated before the flight 
tests were brought in, and the whole justification for their use is 
based upon it. The evidence is given in Chapter XVI, together 
With details of test content, weighting of items, scoring and valida- 
tion, and other technical points arising from the need to operate a 
Very complicated type of test in a number of flying schools simul- 
taneously (cf. also Parry, 1947). As the word grading implies, the 
Job of each of these schools was not to pass OF fail the individuals 
it tested, but to array them in order of merit so that the cream of 
each intake could be skimmed by a central authority according to 
the needs of the moment. To utilise the tests to the full it was 
Obviously necessary to grade considerably more than would be 


. * This statement slightly exaggerates the cleavage. If intelligence and pem 
tional attainment are regarded as aptitudes then aptitude did play some be d 
Part in the A.C.S.B. procedure. It is, however, true that temperament am 

character assessment played no part at all in category classification, nor у 
evidence to hand to suggest that they should have done. 


' that weight was 


76 PERSONNEL SELECTION IN THE BRITISH FORCES 


required to enter training, but, this margin conceded, on ei 
was as well suited to picking the best 20 per cent. as 
аа 1942 gross pilot training wastage had been hay ren 
mately 48 per cent.; with the introduction of grading t : vec 
dropped to nearly half (25 per cent.) The most important re d 
was very naturally at the elementary stage where the gross d 
i tages were 30 and 14, but a perceptible shrinl ag 

parative percentagi : j ‘ile operational 
(11 to 9) resulted ide ae stage Cavite uc w 

ining wastage fell from 7 per cent. to 2 . Nr 
шнен of ridin inevitably affected the баз рер 
of А.С.5.В.з which hitherto had decided the air crew жу 
each accepted applicant should enter. It was part and parcel Я ds 
new system that a high proportion of those sent to grading ks 5 
not become pilots, and consequently re-classification of t m 
some. other category had to be provided for. To meet this i 
boards early in 1942 were instructed to classify to the pilot, "hh 
gator and bomber categories jointly instead of severally. А 
meant that all deemed suitable for any one of these categories we | 
given the opportunity to go to grading, and as about 90 per wen 
of applicants were anxious to do this the field for the final selec did 
of pilot trainees was greatly increased. The scheme, however, 2d 
nothing to aid the selection of the other categories and the ne f 
for a further selection phase which would permitof the assessment ү 
navigator, gunner, etc. skills, became apparent eighteen months zw 

The primary purpose of aptitude testing was to secure s a 
measures for air crew accepted by the selection boards in d à 
of each of the six main categories. This led to the preparation © v 
two-day testing programme with twenty-four tests yielding pos 
sub-batteries. No experimental period was available either 
devise new tests or validate tests loaned by other services us 
posteriori validation was, of course, planned, and has been t 
out so far as continuity of training has allowed—a summary of t ү 
findings appears їп Chapter XVI, together with a detailed WU 
of the way the problem was handled. Here it need only be adde ч 
given in classification to the recruits’ priua 
wherever skill as measured by the tests was adequate and the nee 
of the Service permitted acceptance, 


BE 
* Tribute must be paid to the U.S.A.A.F. in particular, who loaned a co 
siderable number of their paper t 


ests. 
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From April, 1944 then, scientific measures have been in use for 
the assessment of air crew skills. But the problem of personality 
assessment remained, and though this has been attacked during 
the last two years much still remains to be done. A job analysis 
based on the opinions of 350 air crew officers has at least shown 
close agreement about the personal qualities held important in air 
crew, and incidentally suggests that the needs are much the same _ 
from category to category. It has also shown that the important 
qualities are for the most part conspicuously hard to rate at inter- 
view, while those that are easily assessable (diction, appearance and 
the like) are reckoned of very little account. АП this makes it a 
difficult matter to supersede the unstandardised service interview 
by a more reliable instrument. But the old procedure has been 
Strengthened by the insertion of a pre-interview at which the 
qualities deemed important receive independent rating. Table XLV 
(p. 277) shows that the reliability of trait assessments is on the 
Whole satisfactory. In addition a biographical inventory is being 
Prepared which is expected to throw more light on this question. 

Though the separation of skill and personality assessment has 
been a logical development, their temporal and geographical dis- 
placement has been an accident of circumstances. А fully deve- 
loped selection system works not by a series of successive hurdles, 
but provides for a weighing one against the other of at least the 
most important factors. "The R.A.F. have now taken the first step 
towards this by placing air crew sclection boards and aptitude 
testing at the same centre. 'The next technical problem is to find a 
reliable way of evaluating personality assessment 80 that this can be 
related to aptitude scores. ` 

To complete the picture of air crew se 
should be touched on: ( 

1. Advantage is now taken of the first stage of all—combined 

recruiting centre—to eliminate the more hopeless applicants. 

This is done by means of a fully controlled interview designed 

to anticipate selection board verdicts. It has been found poss- 

ible to anticipate in respect of the worst 5 per cent. to 10 per 

cent. with almost 100 per cent. accuracy. Technically this is 

= purely a screening interview, and is of interest because it has 

a precise and limited objective and has been proved to work 
effectively in the hands of interviewers with very little training. 
2. It has been claimed by the R.C.A.F. that the Link trainer 


LÀ 
lection two further features 
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can be adapted as a selection test to yield results De 
in accuracy to grading. This, if true, would result ina ee 
economy of money, time and effort. An experiment to x 
the claim was planned in April, 1945, but large scale cessatio 
of flying training has resulted in a mere trickle of rape 
evidence. The few cases to hand suggest that Link adds 
‘appreciably to aptitude testing; but it has not yet been con- 
firmed that it adds as much as grading. There is also a very 
real danger of candidates getting access to a Link i 
coming up for selection, and thereby securing a grea 
advantage. А 
It will Dore be well to summarise this rather complex EU 
with a table showing the successive air crew selection phases: 


; Selection 1 
Name of Phase Function ҮСЕШЕ Action 
1. Combined Re- Loose Personal- | Controlled Fliminadon of 
cruiting Centre ity Screen Interview lowest 10-25% 


2. Combined Selec- | А. — Full Per- 


Pre-interview | Results of A and 
tion Centre 


lity Assess- | Technique B weighed Бу. 
Set p a Final Board with 
power to inter- 
B. Differential | 2-Day Testing | view further 
skill Assess- Programme 
ments р 


(А апа В ргіог to mid-1947 constituted successive hurdles) 


3. Grading. Assessment of Standard Better pupils to 
(Potential skill in the air Flight Tests | Pilot training. 
Pilots only) 


after 53, 84 Remainder to other 
and 11} hours) categories found 
suitable in 2B 

Selection of Aircraft Apprentices, Adm 
and Boy Entrants ; 
situations in the R.A.F., outside air crew, 
rentices and boy entrants, and officers. In 
procedure instituted by Stephenson (р. 70) 
fundamental re-direction, though additions 
е been made to it, Further information as to 
flectiveness is supplied in Chapter XVI. 
€s and boy entrants were selected exclusively 
inment. Important though this factor is it 


; . © j ; ill 
een discontinued for administrative reasons, pilot ski 
tude tests alone. 


inistrative Apprentices 


The main selection 
are ground trades, app 
the ground trades the 
has so far received no 


its working and its e 
In 1939 apprentic 
on educational atta; 


* Grading has now b 
being assessed by apti 
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cannot T ғ я 
апа a ЕЕН оп to yield consistent indices of technical ability, 
an amplified ear ies were re-opened after the war the need for 
current program © 1^ procedure was generally recognised. The 
Басе carried эи or aircraft apprentices is in two phases. The 
ing educational e = a number of local centres, consists of a qualify- 
(under 10 ок кы аз а result of which a small proportion 
B seneral intelli nt.) of applicants are eliminated. At the same time 
then called to À nce test 1s taken. All who survive this stage are 
E Ul сл e R.A.F. selection centre where they undergo 
aptitude tests "emt take a one-and-a-half day programme of 
' acceptance ДЕЕ: ае given a two-stage interview. Their final , 
education, TUE 1 determined by three lines of evidence— 
and interpreted EO and technical aptitude, brought together 
used to give mel a trained interviewer. The tests are additionally 
mechanical po i sto trade allocation, particularly as between the 
ОЬ Alin d electrical trades, although every effort is made to 
choices oh rength of trade preference and to concede first 
a large batte Lae aes The tests in use have been selected from 
visionally v EM Ege s to the first post-war intake and pro- 
Will be spread ed against early training results. Full validation 
the four-yea overs number of intakes and will take all stages of 
There 1 т training into account. 
trative abend preliminary qualifying examination for 
to the A Са and boy entrants. The first group are called 
ment, general i md where they take tests in educational attain- 
ability at least ае апі clerical aptitude. A level of general 
А МА as high as that for aircraft apprentices is required. 
of the R.A DE populations constitute the technical nucleus 
Personnel x vin hi etween them provide a large proportion of the 
developing th S igh-grade trades. The information arrived at in 
relevant to OR puc test batteries is consequently directly 
test battery x ЕК ег task of building up а general ground trade 
entrants sth cach er light is expected from the selection of boy 
haero ге required for feeder trades. 
Apprentice auve арр and boy entr 
Choices m interviewed during the selecti 
every а ds carefully explored and effective со 
Ж Eum icant. — — ї 
age of selection is handled by the Direc’ 


consi 5 
loan of E debt is owed to the N.I.I.P. for their generous 


adminis- 


ants are, like aircraft 
on stage, their trade 
ntact made with 


torate of Educational Services. 
advice ап 
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Throughout the war the R.A.F. relied on the interview for its 
choice of officers, but in 1945 it was decided to adopt W.O.S.B. 
methods. A small preliminary experiment (December, 1943) in 
which R.A.F. candidates for commissions were put through both 
selection procedures had pointed towards the new technique as the 
better predictor of O.C.T.U. results. In June, 1946, an R.A.F. 


Selection Board with Air Commodore as president was established ' 


to choose the first post-war entry of cadets for the R.A.F. College, 
Cranwell. Board personnel had the advantage of observation and 
training at Army centres. 

During the following autumn the same board was called on to 
evaluate the claims of serving officers who had applied for per- 
manent commissions, and since then it has been in almost continual 
session, handling in turn both types of entry. Р 

It is early to look for validation evidence in respect of either 
the cadet entry or the serving officer population. Indeed, in view 
of the high rejection rate it is likely to be extremely hard to 
evaluate follow-up results, particularly in the case of the cadets 


where the great proportion of failures are lost to the Service 
completely. 


Other Aspects of Air Force Psychology 
The main types of problems, outside selection, with which 
Science 4 has been concerned are training assessments, instruc- 
tional method, and morale. The first of these has received a fair 
share of attention, the second has been studied only in respect 0 


isolated problems, while the third is claiming consideration now 
for the first time. 


Training Assessment.—The general problem is the introduction 


of objective methods in respect of practical as well as written tasks- 
The unreliability of traditional methods has been demonstrated 
over and over again and hardly needs emphasis here. Two illustra- 
tions will suffice: Variations in the use of a common scale have been 
shown to be so great from one school to another that the pooling 
of marks from different schools often leads to the most grotesque 
results (this, of course, is compatible with objective orders of merit 
within each school). Secondly, scarcely any agreement has been 
found, in many instances, between the marks obtained by indi- 


viduals at similarly named subjects taught in successive stages of 
training. 


— à 
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The objectification of written examinations presents no special 
difficulty in R.A.F. technical trades. The best procedure involves 
the sufnmoning of a small panel of instructors who under the guid- 
ance of a psychologist elaborate a pool of new-type items. From 
this it is usually possible to extract two or three alternative versions 
each with eighty to a hundred questions. 

The difficulties with practical tasks and above all aerial exercises, 
eg. pilot skill, are very much greater and a long experimental 
Period is customary before even moderately trustworthy tests can 
pe attained. The phase check or standard efficiency test (р. 267) 
which entails the application of the objective method to learned 
drill sequences has been adopted in many settings with satis- 
factory results. 

Two by-products of the whole training assessment problem are 
the supplying of reliable criteria against which to validate selection 
tests and the application of up-to-date corrections to syllabus con- 
tent. For the pooling of objective items forces attention оп the 
emphasis given to different areas and in some cases оп the survival 
of features which no longer need be taught. The danger of accept- 
Ing a neatly-made assessment scheme as evidence of sound assess- 
ing has been demonstrated again and again, as also the danger of 
assuming that instruction and assessment should necessarily be 
undertaken by the same people. s 

Instructional Methods.—Apart from a few isolated experiments 
(mostly in the field of morse training) very little has been under- 
taken here. This is not for any lack of problems. On the contrary 
Ше teaching syllabuses of most technical trades have become so 
Swollen and overloaded that there is a crying need for basic investi- 
gation. What is required ideally is the attachment of а qualified 
Psychologist to each of the main R.A.F. schools. Something much 
More fundamental than piecemeal patching and shuffling 15 called 
for. Controlled experiments into such things аѕ length of working 
day, relation of theory and practice, training sequence, classroom 
Conditions, use of demonstration aids, use of notes and selection 


of instructors all need to be investigated. ку. 
. Morale.—Peace-time morale, particularly with the conscription 
issue to reckon with, isa far more difficult matter than morale in us 
he first step is the systematic evaluation of opinions on su E 
topics as general sources of discontent, accident prevention an 
attitude of the conscript to the Service. An attempt is now being 
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made in these directions which make their appeal very naturally 
to the social psychologist rather than the psychometrist. While 
evaluation is in progress the Service psychologist can do much by 
making the Service mind aware that these problems exist and have 
to be met. It is usual and no doubt natural that these efforts should 
often be met with resistance. The view that people will always 
grouse, or that the conscript is bound to resent his period of 
service, is frequently heard, and needs to be combated. 


PART II 


The principles of personnel 
selection and guidance which 
have evolved both from pre- 
war investigations and from 
war-time experience. 


CHAPTER VI 


GENERAL PRINCIPLES OF VOCATIONAL 
CLASSIFICATION 


eo cud work in the Services consisted of a com- 
job puc selection—in the sense of picking the best men for a 
Sh. of guidance—in the sense of recommending the job 
o MEHR best suit each man. This conception, which we call 
and As classification, is worth pursuing 1n peace-time education 
Ё ustry. The following аге the main elements of classification 
Procedures: 

(1) Provision of information about jobs to candidates helps to 
promote self-guidance. The staff responsible for classifi- 
cation must also be provided with information in a usable 

€ form, but do not usually need detailed job analyses. ү 

) Collection of relevant data about adult candidates 15 mainly 
done by biographical questionnaire and interview. But cumu- 
lative pupil record cards are more appropriate with children. 

(3) Educational and vocational classification should take account 
not merely of the candidates' abilities, interests and person- 

alities, but also of sociological factors, in particular, attitudes 
(4) a education or to employment. Ros 
ests and measurements are important, but their réle in 
(5) ишан should not be exaggerated. 
nterviewing has several vital functions 
(6 Weaknesses, cannot be dispensed with. н 
) Classification should not be based on а single 
section” of the individual, but should be linked as а con- 
tinuous process with education or training. к 
(7) The main feature of the scientific approach is validation, ог 
continual checking of the value of the procedures employed. 
(8) Fully qualified psychologists can seldom undertake classi- 
fication themselves, but should give thorough training to 
suitable persons in education, industry, etc» and should 
supervise their work. 


and, in spite of its 


“cross- 
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The work of psychologists in the Forces which we have been 
describing can hardly be called occupational selection or vocational 
guidance. It is a blend of both, which we will call vocational 
classification. The term is not an ideal one since it suggests the 
forcing of recruits into jobs which they may not approve; whereas ` 
the essential object is to adjust or harmonise the whole range of 
recruits (each possessing his or her individual capacities and 
interests) with the range of Service employments (each having its 
particular requirements). 

Occupational selection has in the past often been a somewhat 
anarchical procedure whereby an industry attempted to grab the 
best available workers, and to disregard the wider interests of the 
community. It will be shown later that, provided the supply of men 
for a job greatly exceeds the demand, the quality of employees can 
be raised by almost any selection procedure, whether it be an 
interview by an untrained foreman or a few tests whose validity 
is very low. Naturally the better the procedure the greater the 
improvement in quality, but even an inefficient procedure will 
appreciably cut down the number of unsatisfactory employees 50 
long as no account is taken of the candidates wrongly rejected 
(cf. р. 127). If, however, we pay attention not only to the can- 
didates wrongly accepted, i.e. those who pass the selection proce 
dure and turn out unsatisfactorily, but also those rejected by the 
procedure who could actually have managed the job, the procedure 
must attain a much higher validity. Again, if the supply does not 
greatly exceed the demand, and the so-called selection ratio (the 
number to be employed divided by the number of candidates) 8 

` high, it is far more difficult to choose the best men for the Job 
(cf. Tiffin, 1946). 

In 1941, when psychologists entered the Forces, anarchic con- 
ceptions were rife. Certain branches of the Navy, Army, and Alr 
Force were using their own interview or other procedures to gra 
high-quality recruits, entirely disregarding both the needs of other 
less powerful or less attractive branches, and the interests of the 
‘thousands of men they rejected who would have been perfectly 
adequate for the work. Some branches, as we have seen, eve? 
enlisted the aid of university psychologists to improve their pro” 
cedures. The results were quite gratifying to the branch concerne 
and to the psychologists, but actually produced greater vocation 
maladjustment in the Forces as a whole. The same conception was 
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prevalent in education until the 1944 Act. “Public? and Grammar 
Schools could afford to set stiff entrance and scholarship examina- 
tions, whose validity in picking the pupils best fitted for secondary 
education was dubious, because of the low selection ratio, i.e. the 
excess of candidates over available places. It did not matter to 
them that thousands of pupils were rejected who would have 
derived as much (or greater) benefit from advanced education as 
the acceptances. 

Vocational guidance, on the other hand, has tended to regard 
the interests and capabilities of the individual as paramount, and 
has recommended the job or types of job best suited to him. While 
N.LLP. psychologists have always taken into account the oppor- 
tunities that each individual would have for following the 
recommended career, they were naturally unable to match their 
recommendations closely with the state of the labour market. Their 
system, in other words, works best when demand exceeds supply, 
so that the individual can readily pick and choose among а wide 
Tange of available jobs; whereas vocational selection prefers the 
Opposite state of affairs. Other difficulties with the guidance given 
by the National Institute to individual applicants are its length 
and expense. When each candidate consumes the best part of a . 
day's work by.a highly trained investigator, the fee cannot be 
reduced much below £4, which puts it out of reach of the majority 
of the population. Such guidance, moreover, is largely restricted 


to the higher occupational levels such as the professions, andamuch 


Simpler procedure might be adequate for less skilled work in which 
itself are less important. 


individual aptitude and interest in the work 

Very early then in the history of the Institute, attempts were 
made to extend the procedures more widely. In a series of experi- 
ments in London, Birmingham and elsewhere, guidance was given 
to large groups of school-leavers who were representative of the 
Beneral population. Outstanding features of this work were. 15 
Comparative speed and simplicity, its application to all occupa- 


tional grades, its validation by subsequent follow-up; and ү 
integration with a survey of the labour market in the particu. 


istricts where it was carried out. Thus, in the most iie 
;irmingham experiment, some 1,600 pupils were WM i 
assessed by 74 specially trained teachers, and were placed in Jobs 


which accorded as closely as possible with the assessments, by 10 


juvenile employment officers (cf. Hunt and Smith, 1945). Rodger 
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(1939a) suggests that large-scale vocational classification of school- 
leavers can best be tackled by such co-operation between the 
schools and the Employment Exchanges. Similar schemes have 
been attempted elsewhere, for example, by Macdonald (1938) in 
Edinburgh and Meiklejohn (1945) in Ayrshire. 

In the Forces, as we have seen, it was essential to, find suitable 
employment for all recruits and to consider the requirements of all 
the branches. The supply of good men usually fell far short of the 
demands, and wrong rejections by inefficient selection procedures 
had as serious effects as wrong acceptances. Vocational guidance, 
in the sense of recommending for each recruit the job he could do 
best, was equally inappropriate, since only a limited range © 
employments was available, each requiring certain definite quotas 
of men or women. Thus it was impossible to suit the interests, 
experience and aptitudes of every individual. To, take a nav al 
example: in 1942-3 most of the mechanic branches were crying 
out for men, and large numbers of recruits with no engineering 
experience had to be trained for such jobs. But by 1945 the demand 
dropped so greatly that many recruits with good mechanical ability 
and training had to become seamen, writers (clerks), and so forth. 
Even if Admiralty psychologists had possessed a perfect test which 
showed that, say, 20 per cent. of recruits had good mechanical 
ability, they would have had to disregard its findings when the 
Navy wanted either 30 per cent. or 10 per cent. of recruits for 
mechanic branches. While fluctuations may be less violent m 
civilian life, here too there is no guarantee that the distribution 0 
abilities and interests of school-leavers in any one area will mate 
the distribution of job requirements. And even in the largest 
industrial concern the range of jobs open to employees about to be 
allocated or transferred is obviously still more restricted. The 
present lack of candidates for mining, cotton, brick-making and 
other industries shows how impossible it is to obtain correspoD- 
dence. A good illustration was provided by the Dundee vocation? 
guidance investigation (Pallister, 1938), when it was found that 
roughly 16 per cent. of the girls leaving school would be absorbe 
by each of the main employments—factory work, jute mill, shop 
assistant, and office work. But when a questionnaire was given t9 
typical pupils, 30 per cent. wanted shop assistant, 27 per cent. 
factory, 20 Per cent. office, and 4 per cent. mill. Clearly a large 
proportion of girls were certain to be disappointed. 
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The с i i 
E eolian of classification worked out in the Forces has 
оо Иная applications during the post-war years when 
tion of the Pur riqniia i largely depends on the efficient utilisa- 
in the new appro: PT and woman-power. It is implicit, too, 
amount and LP er) to education. The desire to provide the 
talents and int шо of education that accords with each child's 
E the n erests is praiseworthy, even though, as Burt (19432) 
academic, Veces p classifying pupils at the age of 11-12 into 
sound. As ae and modern "types" is psychologically un- 
and colleges ha jones in С hapter I, most American high schools 
application ae ss пааша counsellors who are trained in the 
interviewin " ern psychological techniques of testing and 
educational « and who are likely, therefore, to be able to guide the 
than, say уссен of the pupils and students more effectively 
. ception of ae ES head master. In industry also the con- 
хова ssification is winning wider recognition. A firm such 
School-leayin BE Nock absorbs every year a certain proportion of the 
Selection fe populanion of the district, and uses its personnel 
available em = ures not so much to pick the cream as to sort the 
factory ro jo wm according to the various openings within the 
iffin (1946) ich they will be most suited (cf. Northcott, 1945). 
out that scien UR American industrial psychologist, similarly points 
abour is in = с vocational methods do not lose their value when 
short supply, since it is all the more important to use 


the 
m for tr. 
to improve s and promotions within any one firm, in order 
e quality of employees assigned to each job, and to 


Improy, 
Ju iue by reducing the number of misfits. 
n proce кибер principles underlying large-scale classifi- 
* applied in S as worked out in the Forces, which can also 
as он modern industry and education ? They may be listed 
1) Pfoviei 6 
) Е of information to candidates and t 
(2) Collecti the cis 
Tm ion of relevant data about the can 
(3) Due , cumulative records, etc. 
(4) Ap ca non of sociological factor 
( HAN of standardised tests of apt 
ОА кү 
ad rri selection or guidan 


Catio 


o those respon- 


didates by question- 


5. 
itudes and abilities. 


ce process with education 
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(7) Validation and follow-up of all procedures. 
(8) Training and supervision of personnel who carry out the 
scheme. 


Provision of Information 

Macrae (1932), Oakley (1937) and others have pointed out that 
one of the most potent sources of vocational maladjustment 1$ 
ignorance or distorted information as to the nature of various types 
of job. Experience in the Navy in particular shows that most 
people are capable of a large measure of self-guidance provided 
that they have access to simple but accurate descriptions of the 
work and all that it involves; and that a large amount of the 
psychologist’s time can be saved if they have received and con- 
sidered this information before seeing him. While direct vocational 
education is seldom if ever advocated nowadays, there is every- 
thing to be said for more education about vocations. In enlightened 
primary schools projects are often devised, which lead the pupils 
to acquire English and arithmetical skills, and these may be centre 
round, say, the post office, the railways, ships, house-building, ОГ 
а local industry. As the leaving age draws nearer, more compre- 
hensive surveys of employments can be provided by films, books 
and pamphlets, visits to works, and, as Rodger (1944) suggests: 
“Brains Trust” discussions. Talks by employers are, of course 
likely to over-emphasise the attractions of certain jobs, but talks 
by young or by mature employees, e.g. former pupils, may be 
enlightening. The value of giving vocational information tO 
beginners in any large industry, by films, talks, and works touts» 
is widely recognised, since it not only helps in the initial process © 
allocation, or in the re-allocation of employees who prove unsuit- 
able, but also stimulates each worker’s interest in his particular 
job when he realises how it fits into the whole scheme of production 

Still more Necessary is it for the person responsible for guidance 
or selection to be familiar with job requirements, and to be able 
to clear up misconceptions in the minds of candidates. Teachers 
who assess and advise school-leavers cannot be expected to be 
industrial experts, though any knowledge they can acquire 21 
use, ог pass on, is better than none. Т hey should for example 
study Oakley's Handbook (1937), the Ministry of Labour's pari 
phlets on careers, and other such literature. But, just as in l a 
Services, so in civilian life the state of the labour market varies, and 
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the nature of many jobs may alter radically. Hence it should be the 
Ministry of Labour’s function to survey the openings and to give 
adequate up-to-date information to the schools. Rodger (1939a) 
Suggests that this should be done under the same headings as the 
teacher or psychologist uses in assessing the qualifications of the 
candidates, for example: 

(1) Physical and medical requirements. 

(2) Educational qualifications. 

(3) Level of general intelligence. 

(4) Special aptitudes. 

Interests. 

6) Disposition or temperamental qualities. X 01 

(7) Other relevant circumstances (training, apprenticeship, ete.) 

It is not possible to enlarge here on scientific job analysis pro- 
cedures (cf. Bingham and Freyd, 1926; Viteles, 1932), but the 
main desiderata should be mentioned. Neither superficial obser- 
vation of the work bya psychologist, nor the description given by 
an employer or foreman are adequate. The experience of men Who 
ate successful at the work and of failures should be critically 
examined and integrated, and special note made of the commoner 
Causes of failure. The analysis should avoid the use of vague traits 
and abilities (persistence, concentration, dexterity and the A 
and should present as objective as possible a description of x 
actual duties or operations performed. At the same time 10 18 
Seldom useful to list in detail the specific movements, etc., or to 
"680ге to time and motion study analyses. Rodger’s list of points 
un be expanded to include information about employment jer 
tions (physical and social), special advantages and egi 
d and other incentives, promotion and prospects, Joes Sal 
А ure of training, relation to other jobs in the organis: , 
Ypes of expérience that may be useful or necessary. 


Collection of Data f 

i tion 

ae A. B. Wilson estimates that at least half of М eum 

p Which vocational decisions are based should ре der a udi 

E St history, the other half consisting of tests and nT и Ж 
S MC Present abilities and other guaia; Hence the enq 

€ past must be very carefully planned. r al 

account is SE Chapter VIII of the biographi 


А ) of the 
Questionnaires adopted in the Forces, and of the value 
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information which they yielded. On these are collected ant 
details of the recruits’ background, but their test scores and E. 
interviewer's findings and recommendations. Thus they Me 
the keystone of the whole classification procedure. Thay are neve d 
theless limited both by the accuracy of the recruits memory, pt 
by the time factor. They cannot contain more questions than E 
dullest recruit can understand and answer in half to three-quarter 
of an hour. М 

Vocational work in schools has the great advantage that it does. 
not have to be carried out in a few hours or days. Relevant features 
of children's careers can be observed and recorded at the иш 
instead of having to be recalled later. Thus, the cumulative recor 
card should be made the focus of classification procedures anne 
children. (In an industrial firm, an analogous record САР d 
obviously of the greatest assistance to the personnel official w s 
deals with transfers and promotions.) While most га 
authorities now maintain pupil record cards, an investigation 
the N.LI.P. (Cockett, 1945) has shown how unsatisfactory are ae 
of these for vocational purposes. Their emphasis tends to be оп 3 
weaknesses of pupils, whereas the careers master is mainly concer с a 
with potentialities and strengths. Usually they are so incomp a 
that he will need to design a supplementary form or else adopt а d 
fully constructed card such as that of the Institute, which covers: 

(1) Records of scholastic attainments over several years. 

(2) Medical data. 

(3) Positions held in school, games, etc. 

(4) Family background, out-of-school employment. 

(5) Interests and hobbies. 

(6) Ratings of traits of temperament and character. 

(7) Objective test results. 

(8) Occupational suggestions. orth 
Hamley's (1937) and Fleming's (1945) suggestions are also У 3 
consulting. Such a document is preferably filled in by the S m 
master or other specially trained member of the staff on the ie 
of discussions with the children’s teachers, rather than by 
various teachers themselves. 


‘Sociological Factors 


nts in 
So numerous and so complex have been the develop mi aac 
psychological and statistical techniques in recent years thai 
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is often a tendency to neglect the wider sociological setting of 
problems in applied psychology. As Burt (1947) has described, 
selection for secondary education is often discussed purely in terms 
of the children’s abilities as measured by examinations or teachers’ 
gradings, intelligence or other psychological tests. It is not enough 
to add assessments of personality traits such as industriousness, Or 
Of interests in different types of curricula. In many cases the main 
factor in success or failure at the secondary school is the attitude 
towards schooling current in the social group from which the chil- 
dren come, and particularly in their families. Similarly, estimates 
have been published of the proportion of the population likely to 
Possess the ability needed for university education, apparently 
ignoring completely the fact that going or not going to a university 
15 largely determined by the attitudes of the social and economic 
Classes to which prospective students belong. When some new 
technique, say intelligence tests or vocational guidance, works well 
In one context, over-enthusiastic advocates apply it indiscrimin- 
ately, not realising that it may require considerable modifications 
to adapt it to the mentalities of the individuals, or to fit it into the 
framework of the social institutions with which they are concerned. 

€ have mentioned in earlier chapters some of the conflicts that 
Occurred between the technicians and Service authorities. Un- 
doubtedly these would have been far more serious but for the con- 
Stant endeavour of the psychologists to adjust their procedures to 

avy and Army traditions, customs and standards. Similarly the 
educational or vocational psychologist should avoid planning any 
Scheme of selection or guidance in the abstract, and should make 
himself thoroughly acquainted with the educational or industrial 
System within which he is working. Burt deplores the absence of 


Sociology f ini :onal psychologists, and 
; rom the training of most educational р: у! А 
5 al workers in many 


Points out that the ision of psychiatric soci r ma 
child guidance clinics сы ль substitute, since their job 
is to note abnormal factors in the home situation rather than the 
normal mores of the society in which the children are reared. » 
To the Tavistock Clinic Group (cf. р. 64), it 18 not mere y 
Sociological but “‘sociatric” factors which are of prime poer 
Dat is the underlying—often unconscious —emotional frs m 
11 the organisation. It is well known, for example, that the 8 
ances consciously expressed by employees їп ап industrial nee say 
about the canteen or about wages, аге often projections Of OE 
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unwitting substitutes for, deeper dissatisfactions, with the o 
attitudes of the management. Similarly a request from a firm for 
the institution of scientific selection methods may be E defence 
reaction, an unconscious desire to ward off enquiries into more 
serious maladjustments. If the psychologist meets such a request 
by devising selection tests, or if welfare officers improve the Е? 
teen arrangements, these fragmentary solutions may actually 
exacerbate the trouble and lead to the appearance of fresh symp- 
toms. Should the psychologist suggest the real source of еш 
and inefficiency, he is apt to arouse hostility, or to be adea 
on to dealing with trivial details. Hence his aim, like that of the 
psychotherapist who is treating a neurotic patient, should be to 
help the institution to understand its own deeper motives and to 
achieve a more harmonious integration. 

While the present writers are sympathetic with such реа 
analytic explanations of the maladjustments both of individuals 
and of social groups or institutions, they would stress the suh 

 jectivity of these approaches. Just as different analysts diagnose the 
Same patient differently and explain his symptoms (and often 
ameliorate them) in terms of diverse theories of human motivation, 
so different “sociatrists” would be liable to analyse a firm's огап 
educational system's difficulties differently. Obviously, too, very 
few persons could be trusted to handle such investigations. On 
Shrinks from the prospect of a newly-fledged educational psycho Б 
logist lecturing his education committee on their unconsciou 
mechanisms which stand in the way of an efficient selection seheng 
Nevertheless, some training in abnormal as well as norma 
sociology should help him, or any other applied psychologist, ч 
realise why scientific selection and guidance schemes are not alway d 
immediately acceptable or practicable, to feel his way tactfully a? 
gradually and to bear with his frustrations. 


Tests it 

The principles of testing are discussed in later chapters. Here x 

is desirable to combat the exaggerated rôle assigned to sean 

of abilities by many writers, particularly in America. угы 
classification is often described as putting "square pegs E } 
“square holes,” аз though it consisted merely in matching mea ; 

cally the measured Characteristics of the candidates with " 

appropriate “profile” of job requirements. It is a misleading ove" 
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simplification of the aims and methods of vocational psychologists 
to neglect all the principles in our list except Nos. 4 and 7—Tests 
and Validation (only admitting Nos. 2 and 5—Personal History 
Items and Interview Judgments—in so far as they can be treated 
as additional *tests"). Indeed, nothing is more likely to bring 
Vocational psychology into disrepute, especially if the tests are 
applied and interpreted by not very highly qualified personnel. It 

‚ 15 gratifying to observe that the plans of American psychologists 
for the rehabilitation and guidance of demobilised recruits assign 
an important but far from pre-eminent function to tests, and 
recognise the crucial value of skilled interviewing, of surveying 
cach individual's past record, and of integrating the guidance рго-, 
cedure with training (cf. Scott & Lindley, 1946). In other words 
they appear to be accepting the view, generally held by British 
Psychologists (cf. Rodger 1939b) that tests should be “servants 
not masters," 

In most jobs such factors as health and physique, interests, 
attitudes and personality qualities, previous experience and train- 
d and sociological circumstances are more influential than the 
SURE which can readily be tested. Such factors can, it 1s true, 
t nes be treated quantitatively or measured by SEDE 
eli 5 (cf. рр. 135-142), but in actual practice they are тоге often 

‘cited by biographical questionnaire and interview, and weighed 
up qualitatively. 4 

Long and Lawshe (1947) point out that in 1919 psychologie 
ae expecting tests to be adopted widely in selection for айу m 

iq job, but that twenty years later there had been very ue е 
Vance. Only about 7 per cent. of a large sample of firms in the 
nited States were making any use of tests in 1936 (though there 

appeared to be a rise in some smaller surveys in 1940). They attri- 

E this partly to undue enthusiasm on the part of testers, partly 

(a undue suspicion among users. In our view the main factors xi 

(a) The extraordinary difficulties of devising practicable tests an 

su Y ing out even preliminary validation, still more of keeping 
Ch validation up to date and accommodating the test ыч 
? alterations in the nature of the work; and (b) the importance 0 


€ factors other than abilities, just mentioned. Few if any indus- 


tri; О 
Hal or educational organisations could afford the numbers 
o carry out voca- 


ghly skilled 1 ded t 
i personnel which would be neede А 
pns Schemes based wholly on tests. In the 1920's Hull (1928) 


i 
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advocated the construction of some 30-40 tests which between 

‘them would measure the abilities needed in 50 of the main occu- 
pations. By mechanical methods an individual’s scores on such a 
battery could be appropriately weighted and combined to predict 
his suitability for all these occupations (cf. p. 104). Only recently 
has this project been put into effect by the U.S. Department of 
Labour, Occupational Analysis Division (Dvorak, 1947). Fifteen 
varied tests, taking about two and a quarter hours, provide mea- 
surements of ten main ability factors, and minimum scores оп 
these factors are indicated for twenty groups of common occupa- 
tions. While the scheme is highly ingenious, we feel that it appears 
to make vocational classification delusively easy. Moreover, the 
validity of several of the tests, and of the factors based on them, in 
predicting occupational success is likely to be low, and no evidence 
on this point has so far reached us. 


The Interview 

This topic, too, receives fuller treatment below, and we will 

merely outline its main functions. 

(i) Clarifying and amplifying the data given in the question- 
naire and assessing its significance; in particular scrutinising 
claims to trade experience. : 

(ii) Providing the interviewee with more detailed information 
about any jobs in which he is interested or which appear 
suitable. 

(iii) Surveying his interests and attitude, e.g. towards different 
types of secondary education, to an industrial firm, or {0 
the Service. While it is entirely feasible to devise scales for 
measuring any one well-defined interest or attitude, ОГ 
several simultaneously (cf. Vernon, 1938a), yet so great 15 
their variety, and so complex their inter-relations !n 
different individuals, that the more flexible, even if less 
accurate, interview approach can scarcely be dispensed with. 

(iv) Judgments of temperament and character from facial 
expression, gestures, manner, conversation, and directed 
discussion of past history and future aims. This is the aspect 
of interviewing which is most unreliable, and most likely 
to differ from one interviewer to another. Nevertheless, it 
can be standardised and improved in many ways, and 
in the absence of good tests of personality, is almost 
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the only feasible approach to the assessment of this vital 
factor. 
(у) Stews and balancing up all the factors such as experi- 
ce, education, abilities, interests, personality and job 
реше Here also there is зо much scope for subjective 
judgment that the conclusions reached by poor interviewers 
may often be less valid than conclusions based, say, on tests 
alone or on the interviewee’s expressed preferences alone. 
Nevertheless, here also more scientific procedures raise 
(vi) рон insuperable technical difficulties. 
stimulating the individual’s morale. Even if (as may some- 
times be the case) the interview is worse than useless, it 
would still be desirable to include it, at least in Service 
classification procedures, because it is by far the most 
human” of psychological techniques. It shows the recruit 
кы һе аз a person is receiving consideration, and that the 
j ; рү is trying to take account of his capabilities and 
ЖҮР ests. It would be impossible too to convince the Service 
thorities that either the recruit or the Service itself was 


getting a fair deal if impersonal methods such as tests and 
d. Similarly, in industry the 


d by sympathetic inter- 
tactful persuasion 


e candidate is to 


viewing is a potent consi 
ften needed if th - 
j ited, rather 


by the interviewer is 0 
accept willingly the job for whi 
than the job which he thinks he would like 


Both Links with Training OF Education 

to to guidance and selection at the present time tend to be based 

О great an е“ + on? of th did: 

S poi extent on a single cross-section e candidates. 
d out under No. 2, classification procedures among 
n children can be spread over several years. While the ques- 


tionnai i 
?nnaires used in'the Forces attempt to take the past into account, 
proach advocated by R.A.F. 


the 2 
loe obvious advantages in the ap ad: 
and the gists, which regards selection and training as parts of one 
ina Se Same process: The effective placement of a civilian recruit 
ul 05 involves both his initial allocation and the success- 
dang eton of his training (or his re-allocation and re-training). 
is in B ITA his competence for the job 1s examined, just as it 
ction testing; moreover, each completed stage of training 
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is considered to be predictive of suitability for the next stage, just 
as is the psychological examination. ~ 

Vocational classification in the Services was handicapped by the 
weakness of the links between the psychological, the training and 
the education branches. Apart from the fact that they did not 
always see eye to eye, the effective integration of their aims and 

‘techniques was rendered impossible by the shortage of psycho- 
logists. The result was that innumerable’ problems of mutual 
interest failed to receive the scientific investigation they deserved. 
Some training courses were largely inappropriate; training devices 
and visual aids were employed without any proof of their value; 
old-fashioned examination methods were used which not only- 
probably failed the wrong men, but also provided thoroughly 
unsatisfactory criteria against which to gauge the validity of selec- 
tion procedures, Again, more attention might have been paid to 
the psychologists' view that, as only a limited supply of highly 
intelligent or well-qualified men was available, certain training 
courses should be simplified and lengthened for the benefit of the 
poorer-quality trainees. The naval “transfer” scheme. and the 
Army A.S.C.s, deScribed above, did help as it were to plug some 
of the leaks, but the most effective schemes of selection-cum- 
training in the Services were those developed for naval officer 
candidates, and for R.A.F. air crew. 

It is appropriate here to point out the close inter-connection 
between selection, training, and factors of design and layout in the 
job itself. When, as so often happens, the work involves needlessly 
complex perceptual and muscular activities, selection is rendered 
more difficult and training lengthened. Conversely, the psycho- 
logist can often simplify his selection problems by modifying the 
equipment in such a way that a much larger proportion of the 
а can manage the job after fairly brief training (cf. Craik, 

The desirability of the conception we are advocating is already 
generally recognised in education, Vocational and educational 
guidance should be a continuous process from 11 years to adult- 
hood. At each stage in a child's career the most suitable type 0 
future schooling, or the most suitable vocational plan should be 
considered. Moreover, the predictive value of the various school 
examinations should be followed up in just the same way as that 
of vocational tests, In many industries, again, the selection and 
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а . 7 
Е of apprentices, or their transfer to other simpler jobs if they 
о make good progress, are combined ina single department. 


Validation 


грани by which educational and vocational classification 
Бүкөш = are neither new nor mysterious. Psychological tests, 
Gerson et questionnaires are simply refined and standardised 
г ииий of the methods to which schools and industries have long 
EM po Thus, the main difference between psychological 
тада procedures is that the psychologist insists оп validating 
NM ments and testing his tests, instead of relying on ‘hunches, 
the 6 сене апа uncontrolled observation. He is aware of 
allibility of human judgment (including his own), of the 


ten ; А 2 
dency to jump to conclusions and to generalise from a few 


inst; ; Б : 
ances, to reason in accordance with our sentiments and com- 
th the facts, and to over- 


pan rather than in accordance wi у 
а 3 the personalities with which we have to deal. Investiga- 
БЕРБ, gradings or assessments of personality traits have shown 
КАҢ mM boe different judges of the same person may differ, 
m ay in uential is the “halo effect,” that is the impregnation of 
S cei gradings of specific qualities with his general good or bad 
ЕЕ ion (cf. Vernon, 1938a). The ordinary man or woman 
At sch ra for example, that because a boy cheats in his work 
this Do he is not necessarily destined for a life of crime, and that 
skill Praes probably has no bearing at all on his mechanical 
NT Е untrained interviewer ог personnel official is often influ- 
rn y even less relevant characteristics—a smiling face, a large 
n hands in the pockets, a girl's use of lipstick, and so on. 

orn, шоо; vocational suitability аге often based not merely 

jective impressions but on such tests as school examinations, 


9 es + 
ae actual performance in some job. This information too may be 
tremely misleading unless it is scientifically validated, and proved 
very common and insidious 


9 Measure the qualities needed. A 

ror, of which doctors and psychologists themselves 25 ee 

hon-scientists are guilty, is the “naming fallacy"—that is the 
ame as a 


as: : 2 vats 
sumption that a test is relevant if it has the same n 
for example, needs 


quality involved in the job. The radar operator 

pood “vision,” but no one has bothered to investigate whether the 

nus of vision tested by the medical officer are actually the same 
ose used in the work. Applied psychologists In Germany Were 


100 PERSONNEL SELECTION IN THE BRITISH FORCES 


always very prone to list the hypothetical abilities involved in.a job 
and to assume that, by devising one or two tests of each of these, 
they could predict job ability. British and American psychologists 
are generally more empirically-minded, and it is fair to claim that 
they know more about the predictive value and the limitations of 
their tests and other methods than do any of the hosts of organisa- 
tions which conduct educational, professional, trade or other 
examinations. 

Validation and follow-up investigations are carried out with: 

(i) Vocational procedures as a whole. 

(ii) Each part of a procedure, such as the separate tests, inter- 

views or questionnaires. 

(iii) The separate items that compose a test or questionnaire, the 
aim being to improve the instruments by choosing the best 
items or revising the ineffective ones. 

It is realised too that procedures or parts of them may have 
satisfactory validity in relation to one type of work, yet not to 
another superficially similar type. Hence they should be validated 
afresh for each job, and the process should continually be kept up 
to date with alterations in the jobs. 

Thus, any system of classification which does not provide for the 
validation and follow-up of its methods, and which is not con- 
tinually trying to improve these methods in the light of their 
results, should be regarded as little better than quackery. We 
entirely agree with Macrae (1932) and others who hold that 
successful guidance is, in many respects, an art as well as a science, 
and with Burt's (1942-3) admission that the intuitions of the quali- 
tative or clinical investigator have often led to more original 
advances in psychology than have the quantitative analyses of the 
statistician. Nevertheless, it is essential for the vocational psycho- 
logist to be trained in scientific techniques of investigation and 
statistical methods, so that even if he is not himself responsible for 
follow-up he can realise the importance of statistical checks and 
can interpret the significance of results obtained by others. 


Training of Personnel 
Since 1942 the supply of trained psychologists has been quite 
inadequate to cope with the demands of the universities, the ' 
Forces, education authorities and clinics, industrial and other 
employers, and these demands appear to be increasing. Itis essential 
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therefore to hand over much of the day-to-day work of classi- 
fication to less highly-qualified staff, as was done quite successfully 
in the Forces. There are, indeed, positive advantages in using 
officers who know more about the Navy or Army, teachers who 
know more about education, and personnel officials who know 
More about industry, than the psychologist himself is likely to 
know. Moreover, as already indicated, the psychologist prefers to 
educate organisations to carry out appropriate vocational schemes 
and to give technical help where needed, rather than to impose 4 
Scheme run by technicians. The system of using non-psychological 
staff worked well in the Navy since the recruiting assistants and 
5.0.5 were very carefully selected and trained in the first place, 
Were kept under constant supervision, and were fed with up-to- 
date information on naval requirements, job analyses and psycho- 
logical techniques. It was only slightly less satisfactory in the Army 
€cause of the very large numbers involved, and because the less 
effective powers of the qualified psychologists made it difficult to 
Maintain as high standards. In both Services, however, à check i 
Was kept on each member of the staff; for the work of a P.S.O. who 
interviews and makes employment recommendations is a technique 
which requires validating just as much as is an intelligence test. It 
Was often possible to eliminate the weaker ones or to transfer them 
to duties for which they were more suited. Problems both of over- 
Work and of boredom were often serious, but could be avoided 
under conditions which reduced the testing and interviewing loa 
and allowed the personnel to be trained for more responsible and 
More varied tasks, such as follow-up, documentation, job analyses, 
and research investigations. 7 DRE Ion 
Large-scale applications of vocational classification 1n SC Sod 
Such as the Birmingham and Ayrshire experiments, have relie 
mainly on the part-time work of partially-trained teachers, and 
Short courses given by the N.LI.P. have helped to provide moder- 
ately well qualified careers masters and mistresses for schools all 
Over the country. But there are very grave dangers in such shane 
being undertaken by insufficiently skilled personnel, dangers n" 
only to the pupils or employees who are misdirected, but 9 ог 
the reputation and progress of the whole of vocational psycho ову, 
present an employer or education authority has no easy mean 
Of distinguishing a competent person from a quack. А des 
in psychology or possession of Associateship of the Britis 
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Psychological Society does not necessarily show ability to under- 
take selection or guidance. Nor, on the other hand, are the many 
persons who acquired valuable experience in the Forces and 
carried out excellent classification work necessarily competent to 
organise civilian schemes on their own. Presumably what is needed 
is a central training institution conferring diplomas which would 
be as widely recognised as the London School of Economics 
Mental Health Certificate for psychiatric social workers (cf. 
Rodger, 1944). It would be desirable too for such diplomas only to 
cover a limited period of years, a refresher course and a re-examina- 
tion in knowledge of techniques and jobs, and in interviewing 
skills, being required before they were re-issued. Fortunately, 
„Psychologists are generally aware of the present unsatisfactory 
state of affairs, hence there is reason to hope that steps will be 
taken both to increase the supply of trained workers and to assess 
and maintain their standards of efficiency. 
A further problem which has not yet been studied at all fully, 
is the ethical one of the use to which information amassed by 
. psychologists or their assistants may be put. Often such informa- 
tion deserves to be treated with as great confidence as that received 
by a doctor or solicitor. It would be intolerable, for example, if 
children's test Scores, or material elicited" im clinical interviews; 
were passed on to employers. We did not, in the Forces, even 
allow the information we possessed about recruits to be handed to 
other Government departments without the recruits’ permission, 
since it had been collected not voluntarily but under conditions of 
Service discipline. At the same time the psychologist is not in the 
position of a doctor who can state: “X is not fit to do this type of 
Work, but I will not disclose what is the matter with him." The 
solution to such problems appears to lie, not so much in the 
development of still more reliable and valid techniques, as in the 
establishment of vocational psychology as a profession such that 
the judgment of a psychologist with appropriate qualifications will 
be accepted as readily as those of his medical or legal colleagues. 


'Obtai ey 
ined. Thus, in judging the value of vocat 
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ional classification 
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(MR P nical difficulties must be borne in mind. 

tional n has demonstrated the effectivenéss of voca- 
of student co cojas developed by the N.LLP.; but substantiation 
adequate. St unselling and child guidance procedures is so far less 
of Officers А e of the proportions of satisfactory trainees ог 
methods, d the Services, who have been selected by psychological 
several st 5 not suffice to prove the value of these methods. But 
е mor udies are cited where recruits 50 selected were found to 
е successful than those selected by older methods. Though 


the f 
9 H . 
Beer OU b апа assessment of the ultimate proficiency of, 
are particularly tricky, some fairly satisfactory results were 
edures аз а whole. 


Obtai 
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ual board members showed considerable variations in skill, 
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and much more investigation is needed (and is now being carried 
out) into the reliability and validity of the separate parts of board 
procedures. ' 


The importance of validation was pointed out in Chapter VI. 
Inevitably the subject is a somewhat complex and technical one, 
but we will attempt to describe its main principles and difficulties 
as simply as possible. "Thereafter we shall present some of the 
evidence for the value of vocational adjustment procedures as a 
whole. Certain essential concepts such as correlation and reliability 
have been discussed elsewhere (cf. Vernon, 1940), and are fully 
treated in most statistical textbooks. Thus brief definitions will 
suffice here. 

Correlation.—NWhen any two sets of scores or gradings are com- 
pared, for example results on a selection test and subsequent 
degree of proficiency, the closeness of agreement can be expressed 
as a correlation coefficient ranging from + 1-0, meaning perfect 
agreement, to 0:0 meaning no agreement at all. If those who do 
well at the test tend to do badly at the job, the coefficient will be 
negative, though this rarely occurs. More often coefficients are low 
positive (+ -2 to + -3), moderate (+ "4 to + -5), or fairly high 
(+ *6 to + -7), though far from perfect. A note on the implication, 
or concrete significance, of the size of correlation coefficients is 
appended at the end of this Chapter. 

Partial Correlation may be used when it is desired to find 
whether some additional test, examination mark or grading con- 
tributes anything to the prediction of some criterion, say job pro- 
ficiency, which is not already covered. Suppose, for example, 
pupils are selected for secondary education solely by an English 
examination, it is likely that the addition of an arithmetic examina- 
tion would cover rather more of the ground, provided that English 
and arithmetic marks do not themselves overlap very closely. If 
the pupils’ marks on the two subjects correlate perfectly, then 
obviously either (Will predict as efficiently as the two combined. 
The partial correlation of arithmetic shows its extra contribution 
when English is, as we say, “held constant.” 

Selection is generally based on several tests, etc., for example, 
English, arithmetic and an intelligence test, and their combined 
predictive value is shown by their multiple correlation coefficient. 
As some tests, considered singly, are likely to be more valid than 
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others, the better ones should, of course, receive more weight. By 
means of statistical analysis a multiple regression equation can be 
found showing the most appropriate weighting for each test, and 
by summing these weighted scores the highest possible multiple 
Correlation is obtained. 

Reliability refers to the stability of test scores, OF their trust- 
Worthiness, quite apart from whether the test measures any 
Particular ability or predicts any job proficiency. If testes’ E id 
alter considerably on taking the test a second time, or on taking 
a second exactly parallel test, the reliability (shown by. the correla- 
tion between their two sets of scores) will be low. Such an untrust- 
Worthy test is unlikely to possess good validity either. When both 
atest and a criterion have reliability coefficients of -6, the maximum 
Possible validity of the test will be -6 instead of 1-0. A good test of 
adequate length should have a reliability exceeding ‘9, and cer- 
tainly not less than -75*. More often the criteria against which 
Selection procedures are validated show poor reliability, as when 
the correlation between judgments of the efficiency of the same 
men by two instructors reaches only + ‘5. [ 

Statistical Significance —There are such Big differences betwee 
егеп individuals that results based on only, a few cases ^ 
Whether average test scores or correlations, etc.—are apt to vary 
also, or to be untrustworthy. Thus, in one group of say, SI ЕА 
there might be perfect agreement between an aptitude test a 
Proficiency, їп another group of the same size no agreement at all. 
ut the extent of such chance variations can be calculated and m 
trustworthiness of a result determined. A correlation coefficient 15 
Said to be statistically significant when we have proved that it would 
€ most unlikely to have arisen by chance. Similarly the аа 
etween the scores of two groups, Or between the corre ut 
obtained in two groups, can be shown to be significant and reliable, 
9r not so, ) 


Types of Criteria in Validation ) PA 
A great variety of criteria have been used in validating gulcan 


and selection procedures. The following classification 15 үе 
i б ‘ability i [3 
i "This account is greatly over-simplified, since reliability coefficients dep 
Чон umerous factors, such as the population tested, and the mer reliability may 
ES. аз well as on the test itself. Moreover, a test with quite 10 m 
pe Quite worth using if it makes a significant contribution to 
attery. But this is not the place for a full discussion 


Cronbach, 1947). è 
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articles'by Farmer (1931) and Stott (1939, 1943), Viteles’s book 
(1932), and on experience gained in the Forces. 


А. Objective Records of Individual Performance 

Selection tests have been compared with the quantity of output 
in industry, or with objective measures of the quality of the work, 
with words per minute transmitted or received by telegraphists or 
taken down or typed by stenographers, with hits on a target in rifle- 
shooting, and so forth. Indirect measures of proficiency include 
wages on a piece-rate job, accidents among operatives or motor 
drivers, breakages or spoiled work, and savings in the cost or time 
of training employees. The numbers of changes of posts, or the 
length of tenure of posts, have been used in assessing the value of 
vocational guidance. 

While such criteria are generally the most satisfactory because 
they'are the most clear-cut and the least affected by subjective 
judgments of employers or instructors, yet they often show rather 
poor reliability (hence records may need to be collected over a con- : 
‘siderable period), and are often distorted by factors not under the 
control of the worker, e.g. breakdowns in machinery or Trade 
Union restrictions. It is seldom, too, that comparable measure- 
ments can be obtained on a sufficiently large number of cases. 


B. Differences between Groups with Known Characteristics 

Men engaged on a given job, and presumably for the most part 
competent at it, may be contrasted with others not so engaged. 
Thus, a mechanical aptitude test should differentiate between 
mechanics and clerks. Advancement is a useful criterion, as when 
officers,instructors orpromoted теп are contrasted with beginners ОГ 
“other ranks.” “Drift to recommended job” is explained below. An 
example of an indirect measure is operating costs such as the saving 
in current on a tramway system after the institution of selection. 


C. Results of Theoretical or Practical Examinations 

School or technical examinations, trade tests, tests of elementary 
training among infantry, constitute examples. In lengthy training 
vourses with several stages, the stage that a trainee reaches without 
failing may be used. Various combinations of examination marks 
may be derived, e.g. by factor analysis (cf. p. 168). 

Inall of these the judgment of one or more examiners, instructors, 
etc., enters either in the marking or grading of the candidates’ work 
or—as in "new-type" tests—in the setting of the questions (сї. 
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Vernon, 1940). Hence variations of standards and the prejudices of 
different examiners inevitably affect the results. 


D. Gradings or Assessments 

This includes gradings based on observations of the man rather 
than of his performance at particular tasks. Merit ratings in 
industry or specially collected assessments of the efficiency of 
officers or men in the Forces are the main examples. Such judg- 
ments are extremely liable to be biased by the social qualities, or 
conformity to discipline, etc., of the people being graded. Many 
forms of rating scale and questionnaire have been developed in an 
attempt to reduce the effects of halo, variations in the judges’ 
Standards, and other defects (cf. Vernon, 1938a). Some of the 
methods adopted in the Forces are described below. £ 

Other criteria under this heading include the award of military 
decorations, the development of psychiatric breakdown, and self- 
ratings of satisfaction with their jobs by people given vocational 
guidance, 

Naturally there is much overlapping. For example, trade tests 
С) are often marked objectively and may, therefore, fall under (А). 
.h many examinations, too, the exáminerknowsthe examinees and is 
influenced in his marking of their work (C) by his general impres- 
Sion (D). Another classification which cuts across this is “training 
V. operational" All types of criteria may be applied during of at 
the end of training, or else to fully trained men working at the job. 


The Collection of Validatory Data 

Some books on applied psychology give the impression that 
Validation is a simple and straightforward matter of correlating 
Selection test results with a criterion of proficiency- Actually, it 15 
Very rare either in industry or education that a satisfactory, "ready- 
to-wear,” criterion is available. Under conditions of practical life, as 
contrasted with the controlled conditions which the psychologist 
Ппроѕеѕ when working in his laboratory, there are innumer- 
de disturbing influences which must be taken into consideration 


and, as far as possible, corrected. я 

Straker (1944) has given ап illuminating deseP “ш 
Collection of data in the Navy. Least information i$ usually avail- 
able about the most interesting cases—those who fail in their pum 
'ng or at the job. The psychologist should if possible interview suc! 


С 5 Пие, 
men and their instructors in order to pin down the causes of failure, 
\ 
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whether medical, disciplinary, or lack of ability or interest. Some 
training schools get rid of a number of trainees on arbitrary and 
inadequate grounds before the training course even starts, hence 
he should work on lists of men put forward for training rather than 
on the school’s own class lists. Next he may find that several men 
have been “‘back-classed,” i.e. transferred to a later class in order 
to increase the effective length of their training, and again the 
reasons may be varied—medical or compassionate as well as 
inefficiency. Training courses often have several sections which are 
separately marked, and the psychologist must decide which of these 
are essential for his purpose. The mere final pass or fail results are 
unlikely to be adequate for statistical analysis.* When a particular 
stage of training is conducted in more than one establishment, or 
by several different instructors, it is important to study the com- 
parability of the sets of results. Again the psychologist’s plans are 
liable to be upset by sudden alterations in the length of the training 
course, or in the syllabus, e.g. with the invention of some new 
piece of equipment, or by the transfer of the training to a neW 
school. Although the numbers under training at any one time may 
be small, he cannot usually wait to collect proficiency data on large 
groups. He must select and advise recruits as best he can, on the 
basis of past experience of similar jobs and on his own judgment, 
and improve his techniques as he goes along. All these difficulties 
have their parallels among school children (cf. McClelland, 1942), 
and among apprentices or trainees in industry. 

Success at training is obviously a less satisfactory criterion than 
success at the job, but nevertheless usually has to be employed. 
Training may last only a few weeks but often extends to one oF 
more years among skilled tradesmen and professional workers 
e.g. doctors. In such cases the psychologist should try to follow 
the trainees throughout, but is forced to guide his methods chiefly 
by results obtained in the early stages. Moreover, he has to select 
men who will pass training courses even if he is convinced that job 
requirements differ considerably from training requirements. 
Much evidence was collected, especially in the Army, as to the 
inappropriateness of some training courses. For example, many 


* The statistician can use any kind of results i highest 

d у ‚ such as rankings from hig! 
to lowest, or Excellent, Good, Average, Fair and Poor grades, or merely Pass ©, 
Fail, provided the numbers are large enough. But an extended distribution о! 


БЫ) the most convenient, and most economical of numbers (cf. Vernon; 
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recruits from civilian enginéering trades could not ‘be recom- 
mended for corresponding Army trades since they were too poorly 
educated to learn the required theory. Yet, in the field, these same 
men were transferred into tradesmen, without any training, and 
managed the work satisfactorily. 

When gradings (Type D) have to be used rather than marks 
(Type C), it was found best to formulate highly concrete questions 
Which forced the grader to assess the performance of each man in 
his training or at his job, instead of giving'an all-round opinion or 
judgments of general personal qualities. Useful questionnaires for 
following up officers were developed by W.O.S.B. psychologists, 
Containing some twenty to thirty items each with two to four 
Possible answers*. A similar but briefer questionnaire for other 
Tank investigations is reproduced here. Though the filling up of 
Such a form for a number of men appears formidable, graders find 
it easy to use, since it is couched in their own language, and feel 
more confident than they do in grading general traits. It is best 
done under supervision but can be sent out and returned by post 
when the men to be assessed are scattered in many different units. 


FortLow-UP QUESTIONNAIRE 


I.—GENERAL SoLDIERLY QUALITIES у ; 
However great the pu m battle, he never showed signs of cracking. 
- He stood up quite well when things were unhealthy. 
+ He needed nursing when the going was tough. 
He was not much use in battle. 4 
In action you could always depend on him to do 
being told. * 
When not under immediate command, he usually kept 
e always needed to be led and told what to do. 
. Hd CU tended to get bun дс aah 
е accepted bad conditions cheerfully enous®: 4: 
He helped to Se He men’s spirits when conditions were bad. 
- He is rather a disturbing influence. 
He КЕ оп all right Шр the others. p be is with 4 
Ц 15 always ve: opular in any grou) X д i е 
. He always E at s Bue and vigilant, even when direct contact with 0 
H спешу was до рее. 
? was reasonably vigilant. > ; 
е was inclined to ЕБЕ slack after a period © Sov or unexpected 
He is an intelligent soldier who can adapt himself to no 
H та а д 
- He is bright enough, and seldom at а 1055, snag. 
hisis ЖАКЕ КЧ ЛАН, gets stuck when he meets some unforeseen snag 
i ү Field 
h W.O.S.B. follow-up was influenced, like W.O.S-B. d ngu EISE 
eory. Hence it avoided asking for judgments of independen raphic rating 
co ever, the same conclusions as to the-advantage? of concrete, E S chologists 
in = had been reached by Behdvioristically-mind en 
е 1920s (cf. Vernon, 19383). 
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С. 1. His attitude to authority and to correction is apt to be “difficult.” . 
2. His attitude towards his seniors is that of an ordinary good soldier. 


3. He is exceptionally loyal and co-operative towards his officers and N.C.O.s. 


OVERALL ASSESSMENT i 
Outstanding Very High Satisfactory Passable Not up to Standard 
(Best 3) (6) (12) (6) (3 Least Satisfactory) 

Part II.—Proriciency AT His Јов 

ITQS OD Ie е ес oris acts CEET. lh Alternative ]оЬ............+++- 

Operational Experience : Extensive? Moderate? — Slight? 

Н. 1. He is outstandingly effective in handling tools of his trade (arms, tools, 
equipment, etc.). 

- He is competent and éffective enough. 

- He is rather ineffective. 

. His heart has never really been in his job. 

- He is interested in his job. 

- He is a real enthusiast about his job. 

- Physically he is well up to the job. 

- He is handicapped at his job by physical shortcomings. 


- He has a flair for improvising (with tools, materials, etc.) in an unexpected 
difficulty. , 


2. He is reasonably good at making the best use of what is to hand when 
things go wrong. 
. He is lost without the usual tools, materials, etc. 4 
. He has no eye for ground and cover and is apt to make clumsy mistakes. 
He makes reasonably intelligent use of country. 
- He has a real “poacher’s instinct” for using ground and cover. 


OVERALL ASSESSMENT d 
Outstanding ау Migh Satisfactory Passable Not up to Standar 


(Best 3) (6) (12) (6) (3 Least Satisfactory) 

A distinct method developed by Admiralty psychologists in 1942 
might be termed the clinical method, since the graders are not 
required to give any verbal, numerical, or other gradings. A con- 
ference is held with the officers or others who have had most to do 
with the men, and a general discussion is held to bring out the 
strong and weak points of each man as a whole. The discussion 1$ 
kept on as concrete a level as possible; such questions are asked 
as: “Which men would you choose to take with you on a small 
landing party, and why?” 'The psychologist thus tries to disen- 
tangle the efficiency or other relevant qualities of the men from the 
less relevant social-emotional qualities which so often bias the 
graders’ judgments. For example, cross-examination may reveal 
that a recruit would be considered excellent at his job, but that he 
is an ardent communist. From the discussion the psychologist 
decides which men are, Say, outstanding, above average, fair oF 
poor in respect of the qualities that concern him. He can also.judge 
whether some of the raters are too prejudiced or too little acquainted 
with the men for their cases to be worth including in the investiga- 
tion. He should, of course, be ignorant of the men’s test scores OF 
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other selecti 

DES Т е data, so that his gradings will be entirely impartial 
thorough нр den and the clinical approaches demand. E 
п. рни, ч ge of the job, so that the questions may be 

Often pure 5 essential features. 
raters E: arcta and rating scale methods are combined. 'The 
since this helps ea of men can be present simultaneously, 
are two or more amiliarise them with the procedure. But if there 
condition), the Gi for any group of men (a highly desirable 
EE odi кшш junior in rank should be asked for his opinion 
Clearly hoe rE be interviewed independently. 
experienced ee ection of good follow-up evidence requires an 
might resent tl tactful investigator. For example, some officers 
inefficient, but he imputation that any of their men are poor or 
men nee d is may be persuaded to discriminate if asked, “Which 
important piae W.O.S.B.- psychologists point out ап 
ments. ‘Train rence between “out-group” and “in-group” judg- 
by their p who will shortly pass on to otherunits are regarded 
domm: eed. ог temporary officers in a very different 
n belonging to an officer's own company ог crew. 


, Only a few К reatment of УДЕУ Criteria 
tion can be E the complex technical problems raised by valida- 
Vocational m here. Often the apparently poor showing of 
Priateness P f » ures 1s largely due to unreliability or inappro- 
unreliable, E e criteria. Accident rates, for example, are Very 
Correlation os if collected over as long a period as two years, the 
ally no hi Xe rae the rates in the first and second year is gener- 
Written sn than + 4, The untrustworthiness of ordinary 
Papere in tlie ations is well known. Two parallel one- or two-hour 
ment PEER one NEUE subject usually correlate 6 or -7, and the agree- 
higher. Practi SA examiners marking the same papers may be no 
can only Ae or oral tests are generally poorer still since they 
ledge, Meri SES small fraction of the candidates’ skill or know- 
commonly > Seat on personal traits or general efficiency are 
ever, by RAE to yield reliability coefficients around ‘5. How- 
Sufficiently пин several sets of observations, marks or ratings, а 
_ * When pie ren or stable criterion can usually be achieved". 
ne fore orate V tee half a dozen or more lor practical exam- 
c Bre determining the pen characteristics; factor anal у 
© сгїїегїоп сап De built up. Е qualities, аот the resul a more 
ysis УШ 5 ow, for examp 
institutions do—to add all the 


inati i 
9 theoretical OT, 5 
lysis is particularly 
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Even when criterion scores or grades are statistically reliable, 
they may or may not be representative of the proficiency in which 
the invéstigator is interested. Rifle-shooting scores, for instance, 
obviously do not coincide with all-round proficiency of infantry- 
men. In no practical job are marks on purely written examinations 
adequate, and they need to be supplemented with assessments if 
practical tests are not available. Thus the investigator generally 
requires to make at least a rough job analysis, and should consult 
instructors ОЁ other experts, in order to interpret the significance 
of the proffered criteria. 

Human beings vary so widely, and their success or failure at a 
job depends on so many factors, that no sound conclusions as to 
the usefulness of a vocational procedure can be derived from a few 
cases. One of the main reasons for the slow progress of vocational 
psychology-has been the small number of persons in any one 
civilian job available for investigation. In education and in the 
Forces psychologists have been more fortunate. No fixed rules can 
be laid down, but a study of fifty cases or less will usually suffice 
only to give some rough indications to the psychologist himself. А 

, minimum of two hundred is desirable for proving the value of some 
selection technique, and eight hundred for providing detailed 
conclusions, €.g. on the scoring of items in a questionnaire or the 
appropriate weighting of a battery of tests. In order to render a 
conclusion twice as secure, or to halve the influence of chance 
variations, it is always necessary to quadruple the cases. But much 
depends on the care with which the data are collected, and on 
the degree of refinement or coarseness of the measures employed. 

As far back as 1924, Thurstone pointed out that random fluctua- 
tions due to the “luck of the draw" are much less serious than varia- 
tions in the results of different groups brought about by conditions 


which the investigator cannot readily control. The agreement а 


between some procedure and proficiency may differ in different 
firms, schools, or Service units, because the groups are differently 
motivated, e.g. one tester obtains better co-operation than another, 
the training or education differs, the attitudes and general mor ale 
of the groups vary, the jobs at Which proficiency is assessed— 
though nominally the same— differ sufficiently to demand different 
patterns of abilities, etc. Time and again an investigator has 


marks together as measuring a general proficiency factor, or whether it is better 
to regroup them into two or more sets and to validate the selection procedures 
against each set separately. 
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obtai Si 
Ж сео p ы results їп a preliminary validation experi- 
a small PI i e carefully-controlled conditions, perhaps on 
very much eid Э гик only to find the value of the procedure 
m uus ds when he follows up a larger and more miscel- 
Beycholopist е. ndeed, a widely accepted rule among American 
bination of a 1s to insist on double validation. If the best com- 
of one grou Жүз їп battery of tests is worked out from the results 
up in es this combination should be tried out and followed 
statements a E nr before it is put into routine practice. Such 
following ie T s the following pass-marks been applied on the 
Successful,” S, 4. pet cent. of the men selected would have proved 
it Mie Papia wo LH 
experi Meis d recognised in educational ps 
is adequate iam out, even on large numbers, 
rather C en that variations in results between s 
explored b variations between individual pupils, should be 
1940), Were ne of analysis of variance techniques (cf. Lindquist, 
and the For ave hardly begun to apply this conception 1n industry 
ful to car ces. It would, however, undoubtedly be far more use- 
each, in Eras validational investigations on four groups of fifty 
extent of fhe RA training schools or units, and to analyse the 
investigation Р erences іп these four contexts, than to make one 
our proce URN two hundred cases. The latter will only tell us that 
under ide ures valid if applied to other exactly similar samples 
our r cipem conditions, whereas the former will tell us whether 
groups wi аге sufficiently consistent to be applicable to other 
ere conditions may vary as much as they did among the 


four actually studied. 


ychology that no 
in a single school 
n several schools, 


Allowance for Group Differences 


U; 
tion a the statistical techniques for the ty. 
Psychol mentioned are elaborate and not widely 
ogists. And as it is very rarely possible to colle 
for proficiency “under 


о 
2 Case; М 
s selected, trained, and graded 
ems have arisen 10 com- 


identi te 
‘“ntical conditions, many tricky probl т 
s. Take the common 10- 

etest marks 


ini 
QM UE results of numerous small groups. · 
Awarded Jd dap criterion consists of examination ОГ trad | 
Year, whe the end of training, or to pupils at the end of a school 
IN seve fue the trainees or pupils have received their instruction 
ral more or less parallel classes. Such marks depend on: 
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(1) The examinees’ ability, knowledge and skills. 
(2) Their interest, industry and other personality qualities, 
including “examination nerves.” i 

(3) Relevant experience or knowledge prior to training. 

(4) Length of training course. 

(5) Goodness of instruction or teaching. 

(6) The examiners’ standards of marking and personal idiosyn- 

crasies. 

(7) Miscellaneous environmental, health and other conditions. 

(8) Chance factors—the luck of the questions set. 
While this analysis applies primarily to criteria of Type C, similar 
Sources of variation occur with other types. Thus, in Type D it is 
seldom possible to get large groups assessed by the same judge, 
hence the various judges' standards of grading and their varying 
interpretations of the qualities to be graded tend to produce large 
group differences. Under Type A, weather conditions obviously 
affect the rifle-shooting scores of groups of men who fire on 
different days; accident rates depend on the exposure each group 
has undergone, and so on. Now selection procedure only attempts 
to predict Nos. 1 and 2 on our list, and to take No. 3 into account. 
It is essential therefore to keep Nos. 4 to 8 as constant as possible 
in all the groups followed up, if the true agreement between the 
predictions and the results is to be obtained, or else, as already 
indicated, the investigation must be extended to several groups and 
the consistency of the findings analysed. In collecting follow-up 
data as full information as possible must be obtained under head- 
ings 3 to 8. When these factors operate randomly, neither helping 
nor hindering groups of initially good or poor quality, and when 
they produce variations of not more than about 10 per cent. in the 
marks, then it can be proved that their effect is merely to reduce 
the validity coefficients slightly (cf. McClelland, 1942). But more 
often they operate selectively, for example, the initially poorer 
groups get poorer training, or alternatively they are provided with 
better instructors. This greatly distorts the correlations, occasion- 
ally raising them spuriously, more often reducing them consider- 
ably. Similarly when an officer or instructor assesses the proficiency 
of his men more highly than does a second judge rating a second. 
group, it may be difficult to tell how far the first group is really 
superior in quality, how far the difference is due to No. 6 on our 
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list, and there is usually no means of allowing for its effect on the 
Correlations*. 
3 Selectivity 
Psychology resembles the other sciences in that the only sound 
way it has of determining the effect of a certain condition is to 
observe the results of applying that condition, all other factors 
being kept constant. Thus the ideal method of validating a voca- 
tional procedure is to compare the results in a group subjected to 
this procedure with the results obtained in another group precisely 
Similar in all respects except for the absence of the procedure. This 
often raises insuperable practical difficulties. One may compare 
output or examination results before and after the introduction of 
à procedure, but almost always the training given to the second 
Broup, or the job itself, will also have altered, or other changes n 
tne examiners, the morale of the workers, etc., will have occurred. 
f gradings are used, they will depend more on the graders 
Opinions of the procedure than on the efficiency of the men. An 
employer's satisfaction or dissatisfaction with the men sent him at 
ferent periods does not Constitute scientific evidence. Every 
efort should be made then to follow up simultaneously a group 
$a bjected to the procedure and a control group; and when к. 
he judgment enters into the criterion, the examiners oF 8 
ould not know which men belong to which group. А 
Naturally, however, neither the Services, nor sip 
education authorities, can usually afford to accept such b 7 
trainees, when it is certain that a large proportion of them will fa? 
?nd the psychologist has to make do with a very unsatisfactory 
alternative. In effect he grades his candidates, at the time of а. 
Ment, into more and less suitable groups and subsequently vA. 
Whether the former are superior to the latter 00 the criterion. ТШ 
* is forced to deal with selected, and unduly hamogene ol 
8toups. He can study the value of, say, tests for secondary i E 
Suus or mechanics only by following up. the ci US 
t €chanics who were originally chosen largely on the К EI 
ests, Not only are the differences between his more an 


: etween all 

able Broups likely to be smaller than the differences b 
i ults from 
M McClelland does, indeed, provide a method of m oun is applic- 
able tous groups which brings them to a common standard, is- 
only under conditions which never operated in t 


еп а 
ability. some test taken by all the groups which corre?" 
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his selectees and a control group—hence his validity correlations 
are greatly reduced—but also all other correlations tend to be 
distorted to a varying extent by this selectivity. Suppose, for 
example, he selects men on the basis of previous education, but 
applies selection tests with a view to choosing the best tests and 
substituting them later for the education: he is almost sure to find 
the tests agreeing better with proficiency than does education, and 
yet their agreement will certainly be poorer than it would have 
been had the men been entirely unselected. In other words, any 
test or selection procedure not used for selection appears to give 
better predictions than procedures which have been used, yet no 
procedure can display its full predictive value unless—a rather 
unlikely contingency—it is quite independent of, or uncorrelated 
with, the procedures actually employed. Again it follows that the 
-better the selection scheme already in operation, the poorer will its 
validity appear to be when a selected group is followed up. In one 
experiment in a new mechanics branch in the Navy, the first 300 
men sent for training were almost unselected since little was known 
about the job requirements. With the next 300 more information 
was available and standards were raised. With the next 500 there 
was still further improvement in the selection procedure. But the 
correlation coefficients between T2 and course results in these 
three groups were -69, -53 and :38 respectively, and all the standard 
naval tests showed similar declines in validity (cf. Vernon, 
1946b). 

There are, it is true, statistical formulae which can theoretically 
correct for this selectivity and give the correlations that should 
have been obtained in an unselected group (Burt, 1943b). ‘Their 
weakness is, however, that the precise basis of selection is seldom 
known. Some psychologists or P.S.O.s may have relied largely on 
test Scores, others on previous education or experience, others on 
judgments of temperament and interests. The formulae only wor k 
when all the candidates have been chosen on a particular test ОГ 
battery of tests, so that the exact amount of homogeneity (restric- 
tion in range of ability) can be determined*. It often happens, for’ 
example, that Р.5.0.5 allocate to a certain job men with low test 
scores, but with compensating qualities of keenness or relevant 
experience, and when these men do well at the job, they reduce the 


^ P р 
They assume too that all the variables involved give normal distributions of 
scores; this is not necessarily true of all selection tests and is usually untrue 0: 


such factors as experience. 
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size of the А 

method of E uh between the tests and the criterion. No 
In conclusio ae adequate in this situation. Е 
to being an ae it may be seen that validation only approximates 
results are d Deum In most practical situations the validatory 
Di iure of statisti y so many uncontrolled influences that a large 
clusions. O 2 т judgment is needed in arriving at sound con- 
Dot be the fault ke en these results seem disappointing this may 
also when the of the psychologist or of his methods. Sometimes 
with E rather favourable they need to be accepted 
heterogeneous e samples may have been too small, or unduly 
that, as the $, or other defects may be present. We must hope 
understood tee outlined above come to be more widely 
will yield m will be possible to plan better experiments which 

ore clear-cut answers to validation problems. 


Previous Validatory Studies 


It is di 
Jt 1s difficu л 
Scientific ус Io find any good instances in the literature of 
ation of vocational selection procedures which 
des tests, though there 


employ i i 
аге EE xs and other techniques besi 
introducin s claims, not very well supported, a$ to the benefits of 
and С ЗЕ Psychologica! methods into industrial organisations, 
and, havin, = * studies of tests. Vocational guidance, on the other 
psychologists, ies conducted largely by academically traine 
useful su E always laid more stress оп thorough validation. 
ry of British investigations to date is given by 


a (1943). 
the р 
cases who had received full-scale individual guidance 
five years later. 


from 

the - 

"rom hb 463 were followed up three to 

о a questionnaire Or interview, the numbers who 
judged. Of the 


Were © 

45 ас pns and happy in their work” were J 
mendations he which accorded with the Institute's recom- 
Non-accord: Бе as were successful, whereas of the 118 in 
Sidering th цаз jobs only 57 per cent. were successful. Con- 
to be ies individuals seeking guidance from the N.LLP. tend 
tain, the ae cases, whose vocational suitability is very uncer- 
| itions ma js cent. success is highly grati ing. 90 many con- 
Could sca; y alter in three to five years that a larger percentage 
IN скы be expected. É 
led JA cud which conform more 
ification, rather than guidance 


closely to what we have 
or selection, are those 
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of Rodger (1937) in a Borstal institution and of Hunt and Smith 
(1945) at Birmingham. Rodger compared assessments by instruc- 
tors of the efficiency and progress of 200 youths allocated to work 
parties in the usual manner by their housemaster, and 200 allo- 
cated by himself on the basis of tests and short interviews. Of the 
former 45-6 per cent. and of the latter 69:5 per cent. were given 
Grade A reports. While the Superiority of the psychologist’s cases 
is not very striking, it is probably reduced by the unreliability of 
the instructors’ gradings, and other factors. 
In the largest Birmingham enquiry, 1,639 children were followed 
up over two years and 603 over four years, roughly half being boys, 
half girls. Half of these had been given guidance by psychologically 
/ trained teachers on leaving school, the rest—a control group—had 
only received advice at ordinary employment conferences. Numer- 


ous criteria were employed, includin 


g employers’ gradings and 


length of tenure of posts, all of which gave favourable results. T. he 
most interesting figures are listed in Table I. Among the “guided 

- cases, 90 to 93 per cent. of those in recommended jobs state that they 
are satisfied, and only 26 to 33 per cent. of those in “non-accord- 
ance" jobs. Of the controls, however, the cases who followed the 
employment conference’s advice are if anythin; 
those who did not. Many more guided cases retain accordance 
than non-accordance jobs over a long period, but the controls 
show practically na difference, Finally, it is noteworthy that а 
time passes а larger Proportion of guided cases are found in 


accordance posts ; that is they show a 
which is absent in the less skilfully ad 


g less satisfied than ` 


MS 
"drift to recommended job, 
vised controls. 


TABLE I.—RzsurTs OF BIRMINGHAM VOCATIONAL GUIDANCE EXPERIMENT 


Psychological: 
Guided are Control Group 
ne | — É — 
Non- Non- 
Accordance accordance | Accordance accordance 
jobs jobs jobs jobs 
8 ЕЕ nuca 
er cent. er cent. er cent. | per cent. 
Satisfied with job at end of ^ P á 
2 years (self-ratings) А 90 26 64 16 
Ditto at end of 4 years A 93 33 64 "78 
, Retained first job over 2 years 56 11 37 33 
Ditto, over 4 years 5 , 46 11 27 26 
In accordance jobs at end of 
2 years . 5 87 — 52 = 


Ditto, at end of 4 years s 92 == 47 = 
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St A 

ee emdi procedures in American schools and col- 
(1938) sum yet seem to be so well validated. Thus, Williamson 
engineerin sre several investigations in which arts and 
obtained : udents who were counselled by faculty members 
improbable cv college grades than matched controls, But it is 
training. In т these counsellors had received any psychological 
cent Ui i Я ater research (Williamson and Bordin, 1940), 80 per 
Ofa control ents who had received counselling and 66 per cent. 
well-adjuste eh were assessed, one year later, as emotionally 
significant Th ‘he difference is rather small, yet statistically 
guidance ihe "y have been numerous follow-up studies of child 
ing a satel few have taken the essential precaution of includ- 
maladjusted Ladies in order to discover what proportion of 
theless, C n. yw improve without receiving guidance. Never- 
different аач ogers’ book (1939) shows that the effectiveness of 
now Aim of treatment, e.g. boarding out in foster homes, is 
mention a ing scientific study. Burt and Lewis (1946) briefly 
different a Chae of analysis of variance to the evaluation of 
result of Ne i of treatment of delinquents and neurotics, one 
is more in ae seemed to be that the personality of the therapist 
Psychologi uential than the type of treatment he adopts or the 
Sting ines theory on which it is based. Т ypical of many inter- 
ing on th estigations of the effects of different systems of upbring- 
and B e personalities of the children is that of Baldwin, Kalhorn 

reese (1945), proving that a democratic rather than auto- 


uc A . 

rn ре in the home produces greater intelligence, 

j ity, curiosity and tenacity, and that over-indulgent Or 
have unfortunate effects. 

à 


Teje c 
Jectant attitudes among the parents 
The y. Р 
alue of Personnel Selection Procedures 25 a Whole in the 
Info Л Forces 
rmation on the failure rates of practically all recruits trained 


for d; 
E Henn: A.T.S. jobs in 1944 was collected, and it was con- 
that 94 per cent. had been successfully allocated by Р.5.0.5 
overing over 8,000 


toj 
vm ay could manage. A similar enquiry covert! 4 
Tates кс in several representative , yielded failure 
Cor titus Dus less than 5 per cent. Such figures however, do not 
Posted be evidence, since the equivalent failure rates for recruits 
quali efore D.S P. was established are not known. Also the 
ty of recruits may have risen or the standards of the training 
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schools may have been lowered. If they show anything at all it is 
that the Army and A.T.S. authorities were mostly converted, by 
1944, to approval of personnel selection. 

Much more convincing was a comparison of several groups of 
auxiliaries selected by P.S.O.s with other groups selected by other 
methods and trained simultaneously. Their respective failure rates, 
given in Table II, show great improvement attributable to per- 


TABLE IL—FarLUnE Rates or A.T.S. TRAINEES SELECTED BY DIFFERENT 


Мвтнорѕ 
Мо. Мо. 
Category Selected by | Per cent. | Selected by | Per cent. 
Old Methods| Failed P.S.O.s Failed 
Drivers 124 30 1,004 14 
Clerks 128 11 592 4 
Special Operators 420 60 130 7 
Operators, Wireless and 
Line 217 7 187 05 


sonnel selection. Special operators were particularly striking, since 
improvements in selection procedure were introduced in several 
stages, and with each alteration there was a large drop in the 
failure rate. 

War Office and Admiralty psychologists were especially inter- 
ested in the selection of tradesmen and mechanics, and collected 
some of their most striking evidence in this field. Hotoph compared 
the failure rates on training courses of some 10,000 Army trades- 
men selected by four different procedures during four months of 
1942, with the results listed in Table IIT. 

Apparently the P.S.O.s’ selectees had only about half the overall 
failure rate of men selected by ordinary Army methods and a third 
that of men selected by the Ministry of Labour. However; the 
comparison is somewhat unfair in that more of the third than of the 


TABLE IlL—FarLURE RATES or ARMY TRADESMEN SELECTED BY DIFFERENT 
METHODS 


“Adjusted” 
Failure 
Rate 


Overall 
Failure 
Rate 


Method 4 No. 


—————_: 


Nominated by С.О. or technical officers 3,566 
Nominated at own request - | 3,176 
Called up by Ministry of Labour as 
semi-qualified tradesmen : > 911 
Selected by P.S.O.s 2 E -| 2,201 


VOCATIONAL CLASSIFICATION PROCEDURES 121 


fourth 
"Dno E m to go to trades where failure rates were 
equalise the к. (à attempt was made to adjust for this and to 
given in the ms ers of each group in each trade, with the results 
are still the b column. It will be seen that the P.S.0.s’ selectees 
Ministry = Dm though less strikingly so, and that Army and 
be ебе ре cases achieve much the same rates. It should 
industrial i that Army P.S.O.s had little psychological or 
trade tests or ced and did not, at that time, use any standardised 
their success ests of trade knowledge. Hence the main reasons for 
candidates? must lie in the careful checking, at interview, of the 
who failed се ехрегіепсе, and in the rejec 
tests, о get fairly high scores on intelligence 
In 7 > 
а Fleet Air Arm over 16,000 mechanics and fitters were 
up. Over 6,000 were selected by: standard methods before 


Psychologi 
Course E: ists played any part, and their overall failure rate on 
as 14-7 per cent. When selection was undertaken by 


'S.0.8, worki 

Tate a E mind under a psychologist, not only did the failure 

Much lar ver ШЕП drop to 4-7 per cent., but also they extracted a 

Without d proportion of trainees from the available naval recruits 
denuding other mechanical branches which were likewise 


Makin 
ment сеге demands at that time. Table IV shows that improve- 
S obtained in every category. While the comparability of 
annot 


ens 
rain B . D H 
S ensured, courses and examinations in the two periods ¢ 
, careful enquiry revealed no grounds for supposing that 


Tap 
LE IV. p ў 
У._Рлповв Rates or Freer Am ARM MECHANICS AND FITTERS 


Psychological 


tion of candidates 
and other group 


Old Method усло nod 
Category 
No. of M No. of 
men put |Per cent. | men put Per cent. 
Ar —— on course Failed | оп course Failed 
Mech DE 2,234 69 
ec] rames E " 2: Я 
XE prins eh. no na 2,218 50 
Ordnance —. || 902 73 | 1,596 н 
xs. ЕЗЕР el 332 | 161 1069 | 49 
lr | Ce! rae 
ш Агаш z3 [V 10: 723 "m 
NE OE 959 | 158 из | 54 
Ordnance: 130 131 199 2:0 
-u Бшш. 2 2| 270 92 326 | 1% 
эшк cocus е 890 
: Ё Ё : A 6,530 147 10,008 47 
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there was any relaxation of standards during the period when 
psychologists took over. It has been calculated that this reduction 
in training wastage resulted in a financial saving of fully £100,000 
per annum. 

A point of interest which never seems to have been investigated, 
except at W.O.S.B.s, is the reliability of vocational guidance or 
selection procedures as a whole. If the same group was tested 
and interviewed by two psychologists how closely would their 
recommendations agree? However, reliability is of little impor- 
tance provided that validity is adequate. 


Validation of War Office Selection Board Procedures 

Fitts (1946) points out that there is no scientifically acceptable 

evidence whatever as to the value of the German officer selection 
procedures. A great deal of work was done in this field by the 
British Army Research and Training Centre, and this will be 
reviewed here since it was concerned chiefly with the value of the 
selection as a whole rather than with that of the separate tests OI 
separate board members. If its results are somewhat disappointing, 
this may be attributed largely to the following difficulties: 

(1) Owing to the lack of technical control, different boards used 
different methods and had different standards. It was hardly 
feasible, therefore, to treat candidates graded high, medium 

_ or borderline from different boards as equivalent. Again the 
methods and the recording systems changed at intervals 
during the war, Pass rates also varied markedly with opera- 
tional requirements. i 

(2) The “selectivity” of the population was so high that even 
highly valid procedures would be expected to achieve only 
moderate or low coefficients. Only men regarded as promis- 
ing by P.S.O.s or by their C.O.s reached W.O.S.B.s at all, 
that is barely 10 per cent. of the Army population. Some 
60 or 80 per cent. of these candidates were rejected by the 
boards, and it was, of course, impossible for rejects to be 
sent forward in order to prove what bad officers they would 
make. Another 5 to 20 per cent. of accepted cadets failed in 
their O.C.T.U. training, and during subsequent field train- 
Ing an unknown proportion of the less suitable got shunted 
off into relatively unimportant jobs. Thus, only the final 
cream, amounting perhaps, to 50 per cent. of W.O.S.B. 
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acceptances or about 15 
a мї 16 per cent SEW ЭШИ ШЫ 
se 
where st асу оа board often go to diffe 
io о differ and where, Med E oe Е 
follow-up P are shifted away to new jobs before xt 
to collect an dd can catch them. Thus it is impossible 
trained and 1 (n who have both been selected. 
this:scatterin ssessed uniformly. At the operational stage 
IO, гелеве А ет. naturally, even more marked. Only one or 
trained by PT ably passed by different W.O.S.B.s and 
likely to b ifterent O.C.T.U.s at different eriods, аг 
(4) While inf found in any one unit. PUN 
осту ае is rather more easily collected from 
Whether a ca pue from the field, it is extremely dou tful 
his ultimate a s.0.C.T.U. grades correspond closely with 
(cf. Jenkins pn as an officer. An American investigation 
between. of 47) gave a correlation of only + 18 X 075 
follow-up ше с and subsequent combat ratings“. 
thirteen mo xs officers in various British units, four to 
been! din ia s after commissioning, i.e. before they had 
between О TA yielded correlations averaging + " 
Although 28 iat passing-out grades and C.O.s' opinions. 
reasonably a eee was eventually evolved for securing 
their gradin iable judgments from С.О.з it was fou that 
and the жы. greatly affected by the subalterns’ age, 
that they v eth ottime they had been commissioned, an 
therefore. а in different Arms. For accurate validation, 
these Eire desirable to secure large groups in which all 
to. were held constant—@ practically impossible 
Was P eee, comparison of old boa 
pin type e m by the fact that, during 19 
SER геге]. Bait simultaneously before-the old ones Were 
in OW-up office . Ratings were collected on some 1,200 cadets by 
Structors (wh ELSE held conferences with the T.U. 
vho did not know which cadets came from which 
scale with the 


(5 


= 


rd and new board cadets . 
42, several boards of 


oard), 'T 
тезш Ве gradin : 
ША shown in ‘Table V. reduced to a three-point 
of OS Dot clear e v. 6 
how thi 
ow this figure was reached. From the published tabulations 
ombat ratings, the present 


Writ, cer Sch 
ег: chool : : 
5 obtain a eade D ratings and subsequent с 
c correlation of “32. 


Ы 
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TABLE V.—Tramine RESULTS OF OFFICER CADETS SELECTED BY TRADITIONAL 
AND BY PsycHOLOGICAL METHODS 


Above Average Below Average 
Board No. Grades 8 Average and Fail 
per cent. per cent. per cent. 
Old . . А 491 22:1 41:3 36:6 
New , „| 391 345 40:3 252 


The selectees from seven out of the eight new boards were 
superior to those from all five of the old boards, and this superiority 
was found (though not always to a statistically significant extent) 
in all ten O.C.T.U.s, representing different Arms. Actually the 
result is less favourable than it looks. A true correlation cannot be 
calculated, but it is unlikely that the figures given in Table Vill 
correspond to a higher coefficient than about + -2. Nevertheless, 
the combination of slightly more valid selection procedures with 
greater attractiveness to candidates resulted in the sending of two- 
and-a-half times as many above-average cadets to O.C.T.U.s as the 
old boards would have done, within five months after the establish- 
ment of new boards. s 

The follow-up of 329 officers mentioned above, some of whom 
were old and some new board products, yielded almost identical 
results at O.C.T.U. Any differences between the two groups on 
C.O.s' ratings in their units were too greatly masked by age, Arm; 
and length of commissioning to be calculable. A small-scale follow- 
up of thirty-six R.A.F. candidates for administrative commissions 
gave promising results. These men were both interviewed by 2 
R.A.F. interview board and put through the W.O.S.B. procedure. 
Their grades at the end of eight weeks’ O.C.T.U. are said to have 
correlated fairly closely with the W.O.S.B. predictions (how closely 
„Was not published), not at all with the interview board predictions. 

Several later enquiries in the field showed that C.O.s were wel 
satisfied on the whole with W.O.S.B. products, only some 7-13 
per cent. being considered unsatisfactory either in this country, Шш 
the Mediterranean theatre, or in the Army of the Rhine. Thus, 1? 
spite of the continuous drainage of the best material, W.O.S.B.s 
appear to have maintained the flow into the sixth year of the War 
But it was clear that C.O.g standards were to some extent 
dependent on what they received, ie. that their borderline of 
unsatisfactoriness may have dropped. As no control group © 
subalterns not sent through W.O.S.B. was available, these figures 
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prove little E d 
Sup. A сасе were obtained in an A.T.S. officer 
obtained imate UT of + -28 (uncorrected for selectivity) was 
Те р ATS; W.O.S.É. and О.С.Т.О. gradings, but 
their units id W.O.S.B. with assessments of ATS: officers in 
In 1945, а ough positive, was very small. 

Artillery атт иче; of over 500 officers in Infantry and Royal 
Rhine, and d тт from C.O.s just before the crossing of the 
found E жаты but significant differences in efficiency were 
at one or anothe ae who had been passed as As or Bs, Cs and Ds 
Correlation bet E sixteen W.O.S.B.s. For all groups combined the 
+ -165, but if ween the two sets of gradings amounts to about 
Belexpected ad d WA is made for selectivity the coefficient to 
about je ae all W.O.S.B. candidates gone forward) rises to 
distinctly ieu t was also found that W.O.S.B. predictions were 
Were E dnt = younger than for older men. Among those who 
28, and am at 23 years or under the uncorrected correlation Was 
Clearly the d those who were 28 or over it dropped to 06. 

experience—, :0.5.В.5 had attached insufficient weight to age an 
Ing the CM ашан which C.O.s particularly valued. Consider- 
Oards, i с ility of the criterion and the variations between 

„In the gures are fairly satisfactory. 

Sible, DANS. period, operational follow- 
.O.S.B а large number of investigations have 
Under n give fair predictions of success at O.C 77 
Teach the ЖЕ, controlled conditions, correlations consistently 
Skill of the ‘4 to + °6 level, though they depend greatly on the 
the O,C.T а. board members, and оп the thoroughness of 
U. assessments. 


A fund 
bility, een weakness of W.O.S.B. selection was its unrelia- 
о the large part played by uncontrolled subjective 
dates were put throug 


Judgm 

Wo heat In one experiment 116 candi 

Sented by я t fortnight apart, and the average agreement was repre- 
Y any one cu achoric correlation of only ‘67- Of candidates passe 

another bo oard, 214 per cent. were rejected ог deferred by 

Опе pair x Evidence was collected as to the agreeme 

Pair using hi cards using closely comparable methods, and another 

ence ber ighly "individual" procedures, also on the correspond- 

not published ше gradings of different board members. This was 

among th ed, but it is believed to have shown higher reliability 

е technical than the non-technical members. 


up has been impos- 
proved that 
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In 1945, however, an elaborate experiment, where two b 
composed of highly experienced staff observed or interviewe E 
same 125 candidates, showed quite high reliability. Morcover, б 3 
agreement was at least as good when all members worked A 
pendently as when there was collaboration within, or un | 
teams. Both M.T.O.s and psychologists observed the leader ad 
group and other practical tests. Psychiatrists interviewed T 
pendently, but presidents sat in on each others' interviews. Ta E. 
VI shows the average agreement as to the general suitability A 
candidates in the first column. The coefficients for pun * 
observers are high, for interviewers rather lower, and for who 
teams the final coefficient of -80 is reasonably good. - 

In addition to general suitability judgments, ratings were m 
on fourteen to eighteen carefully defined traits and the me id 
reliability coefficients in the second column show a similar tren Д 
The last column gives the average correlation between each p 
of member and the final gradings of the candidates obtained a t3 
-conference of both teams under an independent chairman. х : 
should be noted that all the figures quoted are averages based i 
several members, and that the separate coefficients for particu : 
members often showed wide variations. Unfortunately, 1t'Was E 
possible to follow up these candidates and so to validate | 
various sets of judgments. ' M". 

The conclusion that appears to follow from this review i 
W.O.S.B. methods, applied haphazardly according to the wh! 


.5.В. 
TABLE VI.—RELIABILITY оғ Jupcments nv Mempers or Two W.O 


EAMS 
| Median Correlation 
Relay Agreement with ЖЫ 
t isposi 
Coefficients | °” dis d Es 
M.T.O. with M.T.O. "ne -86 UT d 
Psychologist with Psychologist | "78 69 
М.Т.О. with Psychologist . — . "19 — r 
75 
President with President. — . 65 68 ur 
Psychiatrist with Psychiatrist a *65 “47 
President with Psychiatrist . d *62 = T 
М.Т.О. or Psychologist with Presi- = 
dent or Psychiatrist А 2 "59 == ` 
AN -91 
Team with Team . ‘ Е “80 “68 ? 
s == == erc M NN Ы Se c 


ment. 
Ment. Indeed, many psychologists wou 
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of 
a ix ie] of slight value, but that when standard tech- 
E 65 ved and applied uniformly by trained and experi- 
Only p a satisfactory reliability may be obtained. 
an ан conditions сап good validity be expected, 
oe a more, admittedly difficult, research is needed 
E of th e value of the contributions of the various 
the procedure. A number of such investigations 


are n 
ow under way. ’ 
of Correlation Coefficients 


It is v й 
a E difficult to convey the practical meaning of a correla- 
cient, say of + +5. It does not imply 80 per cent. agree- 
ac ld say that it represents an 
ге еда only 13} per cent. better than pure chance. 
that we n e of the figure, 134 per cent., 15 as follows. Suppose 
lying at ы the actual output on the job of an employee as 
diction mi aged for all employees, our average error of pre- 
from the 16 t be, say, 100 units. If, however, We predicted output 
as 861 mg s selection test score, our error would still be as big 
Бесс s, on the average, hence the reduction would only be 
D аныр It is more enlightening, however, to consider what 
Certaj. = of good or poor men are selected by a test with a 
ak validity coefficient. 
Unselected an example some job which only о 
1,000 E can perform satisfactorily. I 
Orty of hi idates to choose from, and takes 200 at rand 
Or tests с es will be satisfactory. Whereas if he applies a test, 
With the ar a validity correlation of + :5, and chooses the 200 тепе 
many) wif, test scores, eighty-eight of them (or more than twice 25 
Ment, but S satisfactory. This seems to representa great improve- 
candidate’ the following Table VII shows that, from the individual 
Selection - S standpoint, the tests are not very successful. When 
ave m was random 160 were wrongly rejected, since they could 
и the job, and 160 who could not do it were үш 
CEN The total wastage is, therefore, 390 or 32 per cent. Y. en 
reduction are used the total wastage falls 1o 224, that is à 
E: Tm by only 30 per cent. Table ҮШ. similarly ш 
Quite age when the validity of the tests is very ow, E 
1$ the igh. Not until the validity co ches + 74 
wastage halved, 


Note on the Meaning 


ne out of every five 
f an employer has 
ndom, only 


rrelation rea 
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TABLE VII.—NUM»ERS or SATISFACTORY AND UNSATISFACTORY SELECTEES 
A. Selected at Random 


Satis- | Unsatis- ) Sois- | wnsatis- 


2 аети 
Гати Плот J Гали fc oe 
| a fab (aa Fob et fab, | at fae 


Selected 40 | 160 200 Passed E | 112 200 
je | 640 800 a ae | 
Rejected | 160 | | nd 


| ess | 800 
Total | 200 | воо | 1000 tests 112 
————— | 


B. Selected by Test Battery 


"Total 200 800 | 1000 


3 TES 
Taste VITI.—Wasrace WrrH Tests Havinc Low AND HiGH VALIDIT! 


A. Test Validity .. + +25 B. Test Validity .. + "74 __ 
Satis- | Unsatis- Satis- | Unsatis- 

| factory | factory factory | factory ae 
Passed Passed 200 
tests 64 136 | 200 tests 120 80 | aes 
Failed Failed j 800 
tests 136 664 800 tests | 80 720 ine 
EX z 00 

Total 200 800 1000 Total | 200 800 10 


Unfortunately these Tables would not be correct if the кч 
ratio was altered, unless it became 4/5 instead of 1/5. If 80 per cent. 


of men could do the job, Table VII A would become Table IX, 
and all the others would be similarly reversed. 


° "TABLE IX.—WasraAGE WITH Нісн SELECTION RATIO 


Satis- | Unsatis- 
factory factory 


Selected 


Rejected 
"Total 


But with a selection ration nearer to 3, the wastage would be 
greater, and the more it diver; 


ges from } the more favourable me 
improvement produced by using selection tests. This can readily 
be seen from Fig. 1, which 


gives a graphical presentation of i 
correlation of about +5. Each dot represents one man’s tes 
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score and his subsequent proficiency. When the selection ratio 15%, 
the quadrant A represents men correctly selected by the tests, the 
quadrant C men correctly rejected, and the wastage, shown by the 
numbers in B and D, is fairly high. But when the ratio is very 


Good ! Poor 
Gi 
SUCCESS AT JOB SUCCESS AT IUOS 
lo. lb. 


Fic. 1, ScaTrERGRAMS OF CORRELATION COEFFICIENTS WITH SELECTION RATIOS 
OF 50 PER CENT. AND 20 PER CENT. 


high or low as in Fig. 1b, it may be seen that the wrong acceptances, 
B, and wrong rejects, D, are relatively few. 

Nevertheless, the percentage reduction in wastage does remain 
fairly constant so long as the selection ratio is not greater than 
84 per cent. or smaller than 16 per cent., and this covers the great 
majority: of vocational and educational situations. Fig. 2 is 
borrowed from McClelland (1942). Let the ratio be h per cent., 
then the wastage when no tests are used, or when their validity is 
represented by a correlation of zero, is 2h(100 —h). 


100 
For example, if the ratio is 1 in 5 or 20 per cent., or alternatively 
._ 2 X80 x20 
80 per cent., the wastage 1s Жү А $2 per cent., as shown 


in our Table VII A. For validities of -25, `50 and -74 the graph 
shows that the number of mistakes amounts to 85 per cent., 70 per 
cent. and 50 per cent. respectively of this 32 per cent., so giving 
the numbers listed in Tables VII B and VIII. This same reduction 
would occur with other selection ratios in the middle range. 

So far we have assumed that as many men are rejected or 
selected by the test as the proportion found unsatisfactory or 
satisfactory on the job. Naturally there is no need for this restric- 
tion. Although it is the commonest and most convenient plan, 
McClelland shows that, by an appropriate choice of pass mark on 


Р.5.—5 
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he tests, the wastage can be further reduced. In fact, for each 
election ratio and each validity coefficient, there is a certain test 


100 
IE ҮТ T 
90 | ШЕ. i H | [ | | H 
H | Coo 
13 80 ] 
E: ASEH 
E 70 zi 1 
= 60 Ed | 
5 50 I cH | 
e 40 Li 1818! | 
со 
= 30 
2 
S20 
10 
0 
Wy а Р УЕ aS Ea ӨШЕТЕП. 1 
SIZE OF CORRELATION 
^ 
Fic. 2. PERCENTAGE OF WasTAGE REMAINING WITH DIFFERENT SIZES OF І 


CORRELATION. ї 
j 
| 
| 
| 


pass mark, ‘readily determinable statistically, which produces 
minimum wastage. But to portray this situation would complicate 
matters, and the reader may be referred to the full Tables provided 
by Taylor and Russell (1939), and Tiffin (1946). 


СНАРТЕВ УШ 
THE BIOGRAPHICAL QUESTIONNAIRE 


Abstract—A description is given of the Army qualification form 
and the naval biographical questionnaire, and attention is drawn 
to precautions in the construction and use of such documents. / 
: Occupational experience, education, age, interests and other 
items recorded on questionnaires are shown to be significantly 
related to success in a variety of Service jobs. Examples are drawn 
from investigations of telegraphists, motor drivers, seamen, radio 
mechanics, etc. But there are considerable technical difficulties in 
turning such information into an objective form in order to yield 
direct predictions of occupational suitability. Single questionnaire 
items are very unreliable, especially among low-grade recruits, and 
they overlap in a very complex manner with one another and with 


intelligence. 
CUNETA MUMA 


Typical of the questionnaires used in the Forces is the Army 
qualification form, reproduced on pages 132 and 133. As may be 
seen it covers Service particulars, previous education and employ- 
ment, experience that might be relevant to Army jobs, results of 
psychological tests and medical examination, and the P.S.O.s* 
conclusions. The questionnaire used in the Navy had additional 
sections on school and leisure time interests and on leadership 
experience (captain of a team, secretary of a club, being in charge 
of people, membership of pre-Service organisations, etc.). Though 
the questions are printed, they were read out by the tester to 
groups of recruits and further explanations were given when 
necessary. Incidentally, the front pages of these forms served as a 
useful and adequate, although unstandardised, test of literacy. In 
the case of the 1-2 per cent. of illiterates who could not produce 
an intelligible set of answers even under supervisions, the tester 
in charge would fill it in for them. 

Now even the most straightforward questions may be misread 
or misinterpreted, especially by duller recruits. The following list 
of precautions is based partly on experience and experiments in the 

131 
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S.P. Sheet 100A (July 43) QUALIFICATION FORM 


A. IDENTIFICATION PARTICULARS 
1.—P.T.C/W. No. 
3.—Surname.. 
4.—Initlale 

—Date of Intake 
9.—Army Trade, if any 

10,— Nationality ... 
B. EDUCATION 
13.—Age on leaving last full-time school... 
15.—Matriculation or School Certificate. 
17.—H.8.C. Subjects... 
18.—University.. у чай. үй 
31.—Subjecta taken at professional o courses, Including technical courses and evening clase 


CONFIDENTIAL 


Do NOT write 


2.—Army No. In this margin. ' 


Arm 
n. 
LR. 


10a.—County Connection... 


serves 14--Standard/Class.., 13. ЕВ... 


No. of 
Bubject Saa Where Taken 


C EMPLOYMENT 22.— 


Employer Your Job 


23.—Which of these has been your main Job ?.. 
24.—What exactly did you do 1... 


25.—What fob do you Intend to do after the war ?......, 


27,—Underline whichever of these you can drive :— 


Car Motor-cycle PSV. Heavy Goods Motor Boat 
28.—How long have you held a driving licence? From1 


nee to 1! 
29.—Underline the type of vehicle for which you have Passed a 
Car 


driving tes 
Motor-cyclo Р.8.У. Heavy Goods 
— lerline any of the fol 
онша any ofthe е following of which you have had cient elther in your Job orin 
(а) Metalwork (0 Carrepalra Ф) Typing (D Cooking 
Ф) Woodwork (7) Using precision tools (J) Shorthand (m) Morse Code 
(е) Electrical repairs (9) Handling boats 


(d) Badio repaira 


31.—Previous experience in Н.М. 
Corps :— 


(k) First Ata (n) Photography 


Forces (Incl. Home Guard), Civil Defence, Training or Cadet 


Approx. Dates Nature of Duties 


32.—Were you a member of any boya’ organisation ? 


Which RR 


ае 


Front page of Army Qualification Form. 
132 
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B.P. SHEET 100A, (July 43) CONFIDENTIAL 
Е. POSTING DETAILS 
(2) 
Army Number 


@ 
Nature of 


(6) 
Date of Intake 


Infantry 
Regt. Connection 


a9 
ing Recommendation 


3t | Becond | Third. 


Matrix Bennet | Arith. | Verbal | Instr. 


G. INTERVIEW RECORD 
Main occupation... 


——={ ш 


от. 1 2 3 Potential Tradesinan. ce eere 
Yes] 
n Wears Glasses} XC! }* «Cross out whichever does not apply, 


AOTION ADVISED BY АР. 


P.8, 0," Bignaturts,. neret renes | Data Bigne 


7. RECOMMENDATIONS 
Posted with Training Recdn.. КОКО ЕЕЕ УУ с э —— cantet 


Suggested as. 
Dai 


Р.5.0 'з Signatur: 


Norz,—This form will be sent :— x 
(a) by P.T.O/W. to the War Office (M.P.3) on 16th day after Recruit Joins. 
(8) by the War Office (M.P.3) to P.T.C/W, with posting Instructions. 
(©) by P.T.C/W. to Corps Training Centre to arrive 48 hours before Recrult. 
(d) by Corps Training Centre to Service Unit attached to A.F. B122 


К. PROGRESS RATING ice Unit and foi 
‘This portion to be detached, by cutting along line, by Service Орта 
GPs) ‘one month after the man's arrival in the Service Unit.) RING round the appropriate figure. 


Primary Training | Согра Training 
Name of Unit Efficiency Rating. 
Duties In 
whichtrained General . 1 = Se a 
Efficlency 1 2 
Xxx to be | Signatare 
marked x 


Back page of Army Qualification Form. 
133 
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Forces, partly on the large published literature dealing with the 
questionnaire (cf. Koos, 1928; Symonds, 1931; Goodenough and 
Anderson, 1931; Good, Barr and Scates, 1936; Vernon, 1938a, 
1939b): 


(1) 


(2) 


(3) 


(4) 


The design and layout of the questionnaire should make it 
easy to follow. Bad design may lead to omissions of questions 
and to cramping of the written responses. 4 
The instructions should be clear and simple, couched in 
words of a suitable vocabulary level. Questions should be 
unambiguous and, as far as possible, concrete and specific. 
A preliminary trial with a moderate-sized group of inform- 
ants is extremely useful for showing up obscurities. 

If selective or choice-response questions are to be included, 
a trial with open questions will reveal responses which might 
not have occurred to the psychologist. Too few alternative 
answers are obviously undesirable, but too many are liable 
to confuse. Actually, owing to the danger of misunderstand- 
ings, it is usually better to avoid choice-response questions, 


for example, listing various types of post-primary schooling 
and asking recruits to tick the ones th 
a form is certainl 


subject to statistical analysis, 


variety of human histories. 
say, into the predictive value 
erse responses can readily be 
The only background ques- 
was coded was that used by 
rmants were, of course, above 


right” or "wrong" 
duce any inclination 
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to falsify. Again, emotional terms or prestige names which 
may touch off prejudices or stereotypes are undesirable. If 
questionnaire tests or inventories dealing with attitudes or 
personal ‘traits are to be applied (cf. Chapter XV), they 
should be kept distinct from this essentially factual enquiry. 
In general the motivation of the informants requires careful 
consideration. There should, in other words, be strong 
reasons why they should take the trouble to fill up the 


questionnaire accurately. 


(6) It should be borne in mind that the questionnaire is in 
iven in writing, 


essence an interview, to which answers are gt 
and that it requires the same skilful formulation of questions, 
the same tact, as does an interview. While some informants 
are suspicious of this impersonal instrument and reveal 
themselves more willingly in the face-to-face interview 
situation, others are “bad interviewees” who may be more 


frank in a questionnaire. 


Validity of Biographical Items 

The objectively scored application blank has long been a popular 
technique among American industrial psychologists. The relevance 
of each item is determined empirically and a scoring key prepared 
for use with future applicants. This method was extended to air 
crew in the Royal Canadian Air Force,'to U.S. Army officers and 
other groups during the war, but was never applied systematically 
in this country. Nevertheless, the relation of questionnaire items to 
occupational success was worked out in some seventeen branches 
of the Royal Navy and Army, and the predictive value of particular 
items such as occupation, educational status, experience of driving 
or morse, age, etc., in a great many more jobs. It is not possible to 
summarise all these results, but some specimen investigations will 
be described below. First, however, certain difficulties should. be 
pointed out which are just as likely to arise in industrial or 
educational practice as in the Forces. 
(1) The validation of an item to which only two responses are 
possible requires much larger numbers of cases than the 
validation of a test or other extended variable. Roughly 
double as many are needed to give equally trustworthy 
correlations. Thus an investigation with less than about 400 

cases may be misleading since its indications of the relevance 
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(2) 
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of certain items might not be borne out in another similar 
group. This fact renders the study of previous occupation 
particularly difficult since only quite small numbers are 
likely to have belonged to any one occupation. Occupations 
must, therefore, be classified into very broad catégories, and 
in the absence of any generally accepted scheme, such classi- 
fication tends to be subjective and inconsistent. Significant 
differences between the proficiency of different occupational 
groups cannot be expected unless each group contains more 
than, say, 40 cases; but such groups may be attainable only if 
several heterogeneous occupations are combined. For 
instance, "clerks" might appear to be a straightforward 
group. Yet, in a study of naval officer cadets, accountants, 
Civil Servants, solicitors’ and Local Government clerks were 
excellent, while works clerks, costing, railway and shipping 
clerks were below average, and bank clerks and insurance 
clerks were intermediate (an average of 75 in each group). 
The name of the occupation may also be a very poor guide, 
especially with 18-year-olds. Thus, mate may mean any- 
thing from a highly skilled apprentice to a low-grade 
labourer. 

Items which show fair correlations with proficiency may do 
so merely because they involve intelligence, or because they 
overlap with other items. Taking occupation again, clerks 
were found to be superior in numerous naval branches, 
including some of the mechanical ones, but this was due 
almost entirely to their high intellectual level which made 
them excellent trainees for any job. When intelligence was 
held constant they were found, for example, to be poorer in 
active and fighting qualities in Coastal Forces, and to provide 
an excess of failures in certain mechanical jobs. Similarly the 

good results of those who preferred science or mathematics 

at school to geography, handwork, etc., mainly reflects the 

better schooling and intelligence of the former group. Even 

the advantages of membership of pre-Service organisations 

(А.Т.С., Scouts, etc.) was found to be largely due to the 

same factors. Thus, an item analysis of a questionnaire 

should at least determine the relation of each item to 

intelligence as well as to occupational proficiency, and partial 

correlation (or analysis of covariance) should be used to 
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indicate the independent contributions of each item*. 

(3) As already mentioned, reliance cannot be placed on item 
scores obtained from a single group. The scoring should be 
applied to a fresh group and its validity determined. 

(4) The answers given to many items are so unreliable that it is 
better to substitute tests whenever possible. Hence oral or 
written tests of trade knowledge were introduced, also prac- 
tical achievement tests in morse receiving, shorthand and 
typewriting. It was noted in the Navy that much more 
exaggerated claims to experience of morse were made at 
recruiting centres, where there was no likelihood of their 
being put to the test, than in H.M.S. Royal Arthur, where 
allocation to signaller or telegraphist branches was imminent. 
That genuine experience was, if anything, more valuable 
than good intelligence test scores was proved among telegra- 
phists, air gunners and other “communications” ratings. 
Nevertheless, examples were found of men claiming to 
receive at 8 words/min. who failed their morse training, and 
of men claiming 15 words/min. who required to be back- 
classed before they achieved the requisite accuracy at 20 
words/min. Experience of 4 or fewer words/min. was found 
tó be practically worthless, since morse is often so badly 
taught in cadet units as to handicap the early stages of naval 
morse training. It seems highly probable that carefully 
designed, even if brief, tests of interests would similarly 
have more diagnostic worth than merely ticking each of 
the activities in which recruits think themselves interested. 

(5) Selectivity interferes with the significance of questionnaire 
items in even more complicated fashion than it does with 
test scores. For example, when P.S.O.s have creamed the 
really experienced fitters and turners for the high-grade 
mechanical branches, the follow-up of occupation 1n а less- 
skilled mechanical branch is likely to show plumbers and 
sheet metal workers as superior to fitters and turners. 
Another example may be drawn from communications, 
where it was found on more than one Occasion that men 
claiming mechanical interests were actually poorer than 
average as signallers or telegraphists. This may well have 


* Preferabl. i ion, discrimi function, or multiple contin- 
ly multiple correlation, discriminant iun " t 2 
gency methods should be used in order to allow for the overlapping of items 
With one another and with test scores. 
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been due to the drafting of more responsible and more able 
mechanically-minded recruits to other branches. Theoreti- 
cally it might be possible to correct for such selectivity, but 
no simple statistical technique is as yet available for analys- 
ing selectivity in a large number of yes-no items. 

(6) Items other than two-response ones frequently show non- 
linear relationships to proficiency, so that ordinary correla- 
tion techniques are inappropriate. For example, neither 
recruits possessing 2-4 words/min. morse experience, nor 
those with 1-6 months’ driving experience, were found 
superior to men with no morse or driving experience, but 
greater degrees of experience did give an advantage. Among 
naval officer candidates those leaving school at 16-17 were 
superior to those leaving at 14—15, but those educated to 
18-20 showed no further rise. Age tends to show very 
irregular effects. In some branches 20-25 year olds were 
superior both to 18-19 year and to older men. In communi- 
cations age below 20 made for more rapid morse learning, 
but thereafter there was apparently no big drop till after 
30 or 35 years. In Army motor drivers age up to 25 was 
irrelevant, but then (at least up to 40) the number of hours of 
training needed was roughly identical with years of age- 
The significance of such variables must, therefore, be 
explored by techniques like chi-squared or correlation ratio; 
Which makes it difficult to combine them, or to determine 
their overlapping, with other items. 

J In spite of these difficulties a number of very substantial associa- 
tions were discovered between questionnaire items and proficiency- 
In the Navy it was particularly noticeable that leadership experi- 
ence was advantageous, not only among seamen and officer can- 
didates, but in diverse employments such as air fitters, telegra- 
phists, radio mechanics and safety equipment ratings. Scouts, 
members of Boys' Brigade, and the like, and of organisations such 
аз the A.T.C. and Sea Cadets showed similar superiority, though 
the significance of this finding is—as pointed out—dubious. 

( Table X shows some specimen correlations between occupa- 
tional experience and proficiency in the A.T.S., also between the 
best weighted battery of selection tests and proficiency (uncor- 
rected for selectivity). Although so troublesome to assess, occupa- 
tional success is clearly more predictive than test ability in several 
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jobs. This is particularly true in war-time when training has to be 


as short as possible. 


` 


TABLE X.—Previous OCCUPATION, Tests RESULT AND PROFICIENCY IN THE 
A.T.S 


ivili 7 Correlati 
Civilian Experience A.T.S. Employment with Proficiency 
Experi- Test 
ence Battery 

Clerical General | Overall . a 

Duties proficiency . +296 673 
\ Clerks | Stenographic . 

Cleri proficiency . 671 "37 
lerical Teleprinter Operators 487 *523 
Qomestic work... | Соо Rc. а 204 -390 
Wwitchboard operating . | Switchboard Operators . -533 485 
Motor driving Drivers s 510 -375 


The collected results on driving experience from seven repre- 
sentative driver training regiments in R.A. or infantry are shown 
in Table XI. The final driving proficiency marks or grades are 
classified into good (the best 30 per cent.), medium (42 per cent.), 
and poor (the bottom 28 per cent., including failures). Apparently 
amount of experience beyond seven months has little influence. If 
a tetrachoric correlation is calculated the coefficient of :56 is 
similar to the A.T.S. result, and exceeds the multiple correlation 
of -42 obtained between the standard test battery and proficiency. 
Since experience showed negligible (positive) correlations with 
intelligence, a combination of experience and test scores should 
theoretically yield much improved predictions. Actually, owing to 
the shortage, experienced drivers were accepted regardless of 
intelligence levels, and minimum test standards applied only to 
the inexperienced. 


Tapie XL Errecrs or Previous Ехтнывчсв ох Driven SCREEN! — 


Driving Experi Р 
ig Experience before Recruitment N ae SINUS. 
per cent. per cent. | per cent. 
Professional бы, Diet .. e| 76 48 44 8 
ateur for more than 2 years E. 55 
6 pu to2 years .. i is 58 
ont] Я 
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Ho 411 
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Two rather extensive naval investigations are worth summarising 
in some detail. Among 1,000 seamen and stokers undergoing 
elementary training, instructors picked out the best three and 
worst three from their own classes, yielding 136 *'bests and 136 
“worsts” in all. S.P. tests 1—4 all gave very similar correlations, the 
multiple coefficient being -444 and the correlation of T2 -433. 
Items are listed below in order of their statistical significance (T2 
being held constant by analysis of covariance). 

Age. 35 per cent. of men over 30 and only 9 per cent. of men 
under 20 were bests; the relationship was approximately 
linear. 

Occupation. Retail tradesmen and drivers v. the rest. 

Leadership experience. 

Membership of Scouts or other boys’ organisations. 

Labouring occupations v. the rest. Here the association was 
negative. 

“Social” interests. Ticking the items playing music, concert 
party work, acting, motor driving. 

Clerical occupations v. the rest. This showed the highest direct 
relationship to proficiency, but it was mostly attributable to 
high intelligence. 


Service experience, in Home Guard, J.T.C., A.T.C., Civil 
Defence, etc. 


Education beyond 14, or—negatively—left school at 14 in a class 
below the top. 


Athletic interests. Ticking the items football, boxing, athletics, 

swimming or camping. 

Domestic interests. Tickin 

gardening, cooking. 

Emotional instability, a “Р” rating gave a negative association. 

Only 43 men were so marked, but 23 per cent. of these were 
worsts as contrasted with 13 per cent. of non-‘“Ps.” This 
item, together with mechanical interests and job experience 
were not significant statistically. 

The relationship with Proficiency and the inter-itemi relations 
were expressed as correlations. Multiple correlation showed that 
the most predictive items in combination were T2, age, occu- 
pation, leadership, scouting, Service experience, emotional insta- 
bility, domestic interests, and education. A rough scoring scheme 


for items other than T2 was made out. When the questionnaires 


g the items reading, house repairs, 
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we ndis é 

yer > this waya biserial correlation of +461 was obtained, 
533. No Mer E score combined, the correlation was 
fresh ERE ity occurred for verifying these figures on a 
M uh Wo o n is drawn from a very high-grade naval 
difficult ae pa anics, for which selection was particularly 
өз ОС dene * its extreme specialisation. Groups of 850, 500 
qnot e ollowed up at different times and the correlations of 
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D ed a Wie on test gave correlations of -332 and -457. 
B ers: Bret ү н men passing their training were very high test 
vali diy ER ы у по means all high scorers passed; also that the 
M ire the E es varied significantly in the different centres 
кни Es were trained. These observations suggest 
eee ple 24 willingness to work at so long and complex a 
ane ү NUES part. These are factors which may vary 
the cc im np erent centres, but which may also depend on 
Pens ath о шарды students. Hence it might be possible 
я iara y means of a questionnaire. An educational and 
ES tea Aen Nb; given to the group of 500, and the 
ONSE СЫНЫ: re found to be significantly associated with good 
cos on at school after the age of 15. 
us ance at continuation or evening 
а ag banking, insurance and accoun! 
e best, presumably because they showe' 

hair of purpose"). 
Obs passed any examination. 
nasi credit or distinction in 
n v. passing or merely reaching 


classes (not necessarily 
tancy students Were 
d greatest “serious- 


School Certificate or Matricu- 
“School Certificate 


standard." 

5 i dq, 15. t э 
chool Certificate or Matriculation їп mathematics and/or 
physics. 

se repairs, radio repairs, 
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electrical repairs, or photography. 
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a correlation of -42. Again, 56 per cent. and 29 per cent. respec- 
tively checked two or more of these interests, corresponding to a 
correlation of -22. These two factors and T2 were almost inde- 
pendent, and a scoring scheme based on all three gave the very 
promising multiple correlation of -56. 


CHAPTER 1Х 
THE INTERVIEW AND ITS VALIDITY 


Abstract. —Hints are given on the conduct of employment-or 
diagnostic interviews, with special reference to the contributions 
of Oldfield, Wilson and others. А number of the points are listed 
into which Service psychologists, psychiatrists and P.S.O.s 
enquired when attempting to assess the attitudes and emotional 
stability of recruits. Previous research shows considerable incon- 
sistency between the judgments of different interviewers of the 
same man, and suggests that such judgments are often so biased as 
to have very poor validity. While it is admitted that the personality 
аз а whole can only be summed up by the “clinical”? approach of 
an interviewer, there is evidence that the “psychometric” approach 
—which measures partial facets of the person—may actually give 
better predictions of occupational suitability. E 
Investigations in the Services tended to confirm the unreliability 
of Interviewing, and to show that test results alone have better 
validity than the average P.S.O. interviewer. But there were large 
individual differences in the skill of different P.S.O's, and the. 
Value of interview judgments by qualified psychologists. and 
Psychiatrists was generally high. It is pointed out that, in spite of 
its defects, interviewing is often more economical than an elaborate 
Psychometric programme, and more acceptable to the persons 
Undergoing classification and to their employers. 
——— „шл с=з m 

Тһе technique of interviewing, and the value of conclusions 
based on interviews, have been widely studied by psychologists. 
Useful references include Anderson (1929), Bingham and Moore 
(1931), Symonds (1931), Strang (1937), Rodger; Nadel and Brown 
(1939). We shall summarise some of the main points briefly, an d 
then consider in more detail recent contributions by British 

Writers, 1 
(1) The large variety of types of interview, or of aims of inter- 
viewing, should be recognised, since they require different 
approaches and techniques; for example, the diagnostic 
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(2) 


(3) 
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interview (vocational or psychiatric), the treatment inter- 
view (psychotherapeutic or the giving of vocational or 
educational advice), the research interview for collecting 
information (the social worker’s, anthropologist’s, public 
opinion surveyor's, and others). Interviewing by a board is 


generally considered more artificial and more disturbing 


than the ordinary two-person interview. 

Though mechanical rules cannot be laid down, the tech- 
nique of interviewing can be formulated and greatly 
improved by training. Interviewers should also be carefully 
selected. : 
The main qualities of a good interviewer, and the main 
factors leading to good rapport, are thorough knowledge of 
the jobs or other matters with which the interviewee is con- 
cerned and of topics in which he is interested, emotional 
maturity or a well-adjusted personality such that the inter- 
viewer is not shocked by anything the interviewee says, 
reputation among previous interviewees for sincerity, sym- 
pathy and sensitiveness. Good health and freedom from 
fatigue or strain are also valuable. 


(4) While many writers urge greater standardisation of pro- 


(5) 


cedure, and while an oral questionnaire may be adequate for 
some purposes, e.g. public opinion research, most psycho- 
logists, in this country at least, insist on flexibility. They 
may have a scheme of points to cover, or hold a general plan 
in mind, but they Prefer not to keep to any fixed order О 

topics since this makes. for awkward switches. , The 
Interview is not an experimental situation; it is more effec- 
tively controlled and standardised by aiming at good rapport 
than by using Stereotyped questions. 

All available information should be ready, and should be 
critically studied, beforehand; for example, test scores an 


material which can appropriately be collected by question- 
naire, : 


sion of Plenty of time to Spare. 
Opening enquiries should be factual or otherwise designed 


РОГ CCERBSREKÁKHMm 


чә” 


(8) 


(9) 


(10) 


(11) 


(12) 


(13) 
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шыш ерше nee to arouse shyness, guilt or 
us шс к 3 at y be approached indirectly. But 
маиы is erview must be adapted to the inter- 
y and may be very different with different 
people. 
In general the interviewee should do most of the talking; 
ша I should be provision for spontaneity and initia- 
part. Information obtained by interrogation 1$ 
known to be less reliable than spontaneous narrative. 
A useful way of getting rational and impersonal information 
on traits and interests is to invite comparisons: “Do you 
v doing X to Y ? Are you like your brother (or sister) in 
The taking of notes depends 
topic under discussion. Notes s 
It is of great importance to have г 
to back up any conclusions reac 
made, and not to trust to memory. Info 
observations made should be carefully 
Interpretations. х 
There is ample evidence that any prejudice on the part of 
the interviewer may unwittingly affect the information he 
collects, or judgments he makes. In diagnostic interviewing 
he should particularly try to avoid being biased by one or 
two outstanding merits or defects in the interviewee, and by 
his first impression of him, or by chance resemblance to 
acquaintances, thus neglecting many of the complexities of 
personality. 
The belief in external signs of personal traits (particular 
features, type of build, gestures speech, dress, etc.) is 
highly delusive. Manner and expressive movements may, 
indeed, be significant, but must be intetpreted with caution 
since they may arise from different causes in different people, 
and they are largely dependent both on transient moods, e.g. 
temporary nervousness, and on cultural conventions. There 
is probably no relation whatever between any static facial or 
bodily characteristic and any vocational skill (apart from 
crippling abnormalities). 
In the diagnostic interview it is generally useful for the inter- 
viewer to express his main conclusions in the form of ratings 


on the interviewee and the 
hould be legible and concise. 
ecords of factual evidence 
hed or recommendations 
rmation received and 
distinguished, from 
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on a small number of important and more or less inde- 
pendent personality traits. Such quantification helps to pro- 
duce a thoughtful analysis of the evidence, and is valuable 
in follow-up. The ratings may be supplemented by a pen- 
picture of the person as a whole. 
(14) When an interview involves guidance, the interviewer 
- should sum up the decisions reached at the close, and should 
ensure that the interviewee is set to take some appropriate 
action. 


Recent Discussions of Interviewing 


Among publications since 1939, Oldfield's book (1941) is out- 
standing for its analysis of the steps by which interviewers reach 
their conclusions. In his view the aim of the employment, and 
many other types of, interview is not to collect factual information 
nor to make judgments from this information or other clues, but 
to stimulate the interviewee to display his "attitudes," and to 
observe these attitudes. By attitudes he means consistent ways of 
reacting to situations or topics, or consistent types of behaviour, 
Which are more specific and more overt than the underlying 
personality traits. Such attitudes are partly inferred from expres- 
sive movements (manner, speech, etc.) and partly from information 
about past history, but chiefly by raising various topics and seeing 
how he reacts, by observing the effects on him of sympathy, 
humour, agreement, disagreement, interest, surprise, etc. We thus 
get "the growth and continuously increased articulation of a general 
schematic picture of his (the interviewee’s) personality in the inter- 
viewer's mind.” This schema or working model can then be 
imagined as fitting, or not fitting, into the prospective job. Gener- 
ally, however, it is implicit or unformulated and the psychologically 
untrained interviewer has considerable difficulty in putting it into 
words. He tends to subsume it under one or another of a set of 
stock types or frameworks which inevitably distort the complex 
personality structure. 

Wilson (1945) has described th 
viewing candidates for high- 
officer cadets, skilled mecha 


e methods he employs in inter- 
grade employment or training, such as 
: 1 nics and engineers. His main theme is 
that an interview which can last only about twenty minutes should 
be a strenuous interplay between the two persons concerned, not 
just a pleasant conversation leading to a number of intuitions. No 
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questions should be so superficial that they can be answered merely 
by yes or no. The interviewer should consciously control and vary 
the pace” of the interview. He should possess a clear idea of his 
objectives and should try to collect evidence under a set of headings 
such as those listed below, in terms of which Oldfield’s schema may 
be formulated and the suitability of different candidates compared. 
These do not constitute topics which are taken up seriatim; rather 
the course of an interview consists of an oral autobiography, 
prodded by the interviewer, and any items may contribute data 
under several headings. 

(1) Effective Intelligence —What use has the candidate made of 
the ability shown by his test scores; what are the quality, 
pace and efficiency of his thinking and self-expression ? Can 
he analyse, abstract and generalise about novel and complex 
topics, or is his thinking merely reproductive, vague Ог 
muddled? Does he show intellectual assurance, clarity and 
fluency ? 
Technical Prowess.—]ust as effective intelligence differs 
from test scores, so here the interviewer tries to go beyond 
the candidate’s academic or industrial record. What use can 
he make of his book-work ? Does he understand basic ideas 
and can he apply them to new practical problems resource- 
fully? Is his technical capacity confident, dilettante, original, 


or restricted ? È 

(3) Interests —Their number, range and depth, their stability 
or changeability, whether integrated or conflicting, and 
whether relevant to the proposed employment. The inter- 
viewer should conjure up the atmosphere of the job to see 
whether it may be distasteful. How did the main interests 
arise; are they autonomous or based on some temporary 
identification ? 

(4) Motivation.—Soundness of reason: € 
(or a commission), and strength of drives, for undergoing 
long and tiresome training. Was his motivation steady in the 
past, showing responsibility, pride in achievement, desire to 
complete any task undertaken ? Does he work to inner 
standards rather than outer demands ? Has he a useful degree 
of compulsive or obsessional tendency? Does he possess 2 
detailed and realistic knowledge of the proposed job or train- 
ing course, and has he taken the necessary preliminary steps? 


(2 


> 


s for desiring employment 


П 
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With this are linked his energy or forcefulness and his 
doggedness or persistence in the past, viewed in the light of 
his vocational opportunities. 

(5) Dominance or competitiveness, i.e. his managerial and 
leadership potentialities. 


(6) Acceptability to people he will work with, under or over, 
including social and athletic accomplishments, freedom from 
annoying habits, conventionality of outlook, liking for com- 
pany. This is of importance even among such persons as 
skilled mechanics whose work appears to involve few social 
contacts. 

(7) Personal Attitudes.—Whether co-operative or individualistic, 
irritable or impatient. Who is he influenced by or dependent 
on, e.g. what are his relations to his parents? What are his 
reactions to the Service and authority, also to exerting 
authority ? 

, Numerous hints or clues are listed by Wilson which may throw 
light on these headings, or which should be distrusted. Most reli- 
ance should be placed on the candidate's past record (as amplified 
orally) and on his test results. One feature of Wilson's system that 
appears a little dangerous is his advocacy of informal achievement, 
aptitude and personality tests during the interview; for example, 
setting problems to bring out mental or technical ingenuity and 
pertinacity, observing whether he “crumbles” when the “расе” of 
the interview is too great, or whether his “dominance” comes out 
in his attitude to the interviewer. There is so much evidence of 
the lack of validity, and of divergences in interpreting, such uncon- 
trolled tests that interviewers should not generally be encouraged 
to use them. Valuable factors, which he also stresses, are the 
possession by the interviewer of sufficient information about any 
and every technical, cultural and athletic field to be able to probe 
the thoroughness of the candidate's relevant interests, of adequate 
psychological knowledge to recognise abnormal motivations, and 
of familiarity with the “mythology” which candidates such as 
recruits hold regarding the advantages and disadvantages of 
various employments. Finally, while Wilson prefers separate 
interviews by the psychologist, the medical officer and the technical 
examiner, etc., he considers that candidates should not be 
accepted or rejected on the basis of any one of them, but should be 


THE INTERVIEW AND ITS VALIDITY 149 


P a at a conference where the findings can be integrated, and 
dn be considered from all angles. $ К 
aoe by Misselbrook (1946) gives а useful picture of the 
fifteen du as it were, a lower level, that is the P.S.O.s' ten to 
d eise is interview with candidates for the great bulk of 
E rd employments where the effective use of the available time 
ie e utmost importance. He points out the value of the pre- 
(the à ry talk to each batch of recruits, of the information charts 
ҮТ de to which a recruit has studied these gives a useful index 
la ee of his attitude to an employment), and particu- 
ae n е questionnaires as checked up by the recruiting assis- 
e ac the interviewer is practised in "sight reading" such 
n а һе ог she can adapt a very flexible approach, since 
уз e з the essential points аге already covered. Among other aids 
if d arassed interviewer are sets of oral trade questions which, 
E own by heart, can be applied informally: photographs which 
do pk the conditions of work in prospective jobs and often 
ae ate comments that reveal the candidate's attitude; and 
En. ps pieces which help to put the candidate ateasesince he can 
En about familiar, concrete things, and which bring outknowledge 
methods of production, of standards of workmanship and so on. 
m eda research on the Admiralty questionnaire, together 
3 suggestions put forward by Wilson and Misselbrook, enable 
s to list a number of easily recognisable *contra-indications. 


When several of these bad points are present in the same can- 
Servi for success at any responsible 
€rvice employment. Validatory investigations have yielded such 
uniform results in different jobs that we are entitled to regard 
E peel as a general trait. i 
e rule for à man who is a failure in one kin 
арша kind to which he settles down and manages really success- 
ully. The list includes: 
(1) Unsteady, unprogressive ОГ retrogressive work record; 
periods of unemployment considered in relation to district, 
also to ability. Thus, long-term employment ata low-grade 
job would be commendable in a very dull, but not in an 


intelligent, candidate. ; 
of his own job. 


(2) Inability to give an intelligible account 
(3) Post-war ambition. below the level expected from pre-war 


occupation and test scores. 
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(4) Failure to reach the usual form for his age and intelligence 

at school. 
` (8) No further education in connection with occupation, or 
otherwise, since 14. 

(6) Dislike at school for mathematical and science subjects. 
Handwork, athletics, geography or “none” listed as the 
school subjects liked best. 

(7) Interests limited to the purely social and display type, e.g. 
Jazz playing, the purely athletic, the entirely solitary, or too 
restless and migratory. 

(8) No positive indications of leadership and responsibility. 

(9) Evidence of marked pre-occupation with own health. 

Overlapping with this list is the series of questions, drawn up by 

naval neuropsychiatrists, for application by Wrens at recruiting 
centres, in order to bring to light potential cases of psychiatric 
breakdown. The questions were not asked in a stereotyped fashion 
but were introduced, often indirectly, as topics for discussion. 

(1) Lack of any sustained interests or hobbies, and avoidance of 
social or of physically dangerous interests, 

(2) hs what illnesses 'have you been off work during the last 

ve years ?”” 

(3) “Наз your health been good? Have you had to go to your 
doctor a certain amount (or often) 2” The aim is to find how 
far minor ailments have been allowed to interfere with 
normal activities. 

(4) Танир dieting. “Ро you have to be careful in what you 
eat? 

(5) “How well do you sleep?” 

(6) ee about your powers of endurance? Do you easily get 
шеа?” 2 

(7) “Would you regard yourself as sensitive or highly strung? 
Have you ever had any trouble with your nerves ?” 

(8) Incoherence of employment record. rs 

А recruit whose illnesses interfere disproportionately with his 

career, or who drifts from one job to another for no good reason, 
or who shows others of these symptoms, was marked P by the 
Wren on his questionnaire—that is positive signs of instability. 
Interviews by Army P.S.O.s had to keep more closely to the 
qualification form, since this had not been previously checked by 
recruiting assistants as had the naval questionnaire, also because it 
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was desiral ; : 
en она. their desire to employ hunches in prefer- 
up their conclusi idence. They were, however, instructed to sum 
q ties ions in the form of rough ratings for the following 
I) CT- 
the due temperament). A low rating was given if 
ue moo ү Жаке about going into action” with 
ы Да T Y mode of life was unusually sheltered or 
^ Qe iade if he showed undue avoidance of vigorous and 
E o Мр and fear of serving abroad. (Such 
vente the erred to the psychiatrist.) A high rating was 
() ER. (em d adventurous and aggressive. 
sie cit record). A high rating was given if 
шне) Pa in the light of aptitudes and oppor- 
I ir: een progressive, coherent and continuous; à 
(3) L. (leader if the opposite. 
eu D. This was based on good, average or poor 
Ree pase tien of supervising and directing others in his work or 
(4) OR activities. 
. (officer recommendation) and/or potential N.C.O. 


ne recorded. 
informátion i i 
mation is available as to the employment interviews 


conduc! 2 
азары Py peche either with the low-grades, Army officer 
to be Sons d neurotics. However, Gillespie issued a list of factors 
Viewers in gred ; by R.A.F. medical and psychiatric inter- 
to some Ret T neurotic predisposition. These overlap 
€nquired: nt with the points into which Service psychologists 
(1) S 
e. CA: such as stammering, persiste 
(3) sene fears—of the dark, loneliness, 
shown UE record at school and wor! 
4) Com Ma requent changes. XA 
beet lack of interest in games involving bodily risks 
(5) ШО diving, climbing. — 
(6) Res d reasons for joining R.A.F. or air crew. 
(7) Visce A to sight of blood, e.g fainting. : 
on ral responses before some ordeal such as аһ examina- 
(8) ei ea E match or crucial interview- 
caer ackground, history of mental and 
(9) Reactia home.” 
eactions to fighting at school. 


ntenuresis, insomnia, 


closed spaces, etc. 
k, the latter often 


nervous illness, 
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(10) Food, alcohol and tobacco habits excessively temperate or 
intemperate, or hypochondriacal. 

(11) Temperament: Sociable or unsociable; obsessional traits; 
hysterical (dramatic, immature, suggestible, dodging); 
psychopathic (lack of persistence, extreme irritability, gener- 
ally poor personality); depression; anxious; narcissistic 
(conceited, flamboyant, self-centred), 

(12) Persistent air-sickness, headaches, etc., during training. 


Reliability and Validity of Interview Judgments 

Investigations of the employment interview described by Holl- 
ingworth (1929), Hartog and Rhodes (1935) and others suggest 
that there is so much divergence between the views of different 
interviewers of the same candidates, that the technique is prac- 
tically worthless except for such restricted purposes as a business 
man choosing his own secretary. Several experiments, however, 
have shown much the same reliability for interview judgments as 


Fearing (1942) studied ratings of police officer candidates by 
boards of four judges and found correlations of -23 to -48 on ten 
separate traits. But for the total grading based on all the traits the 
mean agreement was ‘59. Several investigations of public opinion 
surveys show high, but far from perfect, agreement when a second 
interviewer asks the same questions shortly after the first one (cf. 
Cantril, 1944; McNemar, 1946). On straightforward, concrete 
questions such as ownership of a motor-car or telephone there was 
approximately 90 per cent. agreement, and on ratings of age 71 per 
cent. of identical judgments, corresponding to a correlation of ‘91, 
But with ratings of more complex qualities such as economic 
status, only 54 per cent. of judgments were identical—a correlation 
of :63. Perhaps the most striking study was that of № ewman, 
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Bobbitt and Cameron (1946), who interviewed 536 U.S. coast- 
iem ad candidates and tried to assess the very complex 
ET © ability to pass training and to make a good officer in 
inde » E. psychiatrist's judgments correlated 81 and -86 with 
sug, NES judgments by two psychologists This 
de E that psychologically-trained interviewers who have 
un ‚а clear and agreed conception of what they are looking for 
I achieve very satisfactory reliability. 
a E tigations of validity are not very favourable. Webb obtained 
ue СА of -63 between his interviewers’ judgments and judg- ` 
Md teachers who knew the boys, but Magson only found a 
12 bet ion of -18 between interviewers and fellow students, and 
theco ween interviewers and results of intelligence tests. Possibly 
Clark nception of intelligence among adults is particularly equivocal. 
ege (1926) showed that two interviewers could estimate the col- 
t pedes of students with validities of 66 and -73.- But МЕ 
from чен are little if any better than might be obtaine 
Odes attery of aptitude and achievement tests, and from previous 
a" The data reported by Bobbitt and Newman (1944) ш 
BE азе yield tetrachoric correlations of -49 between combine 
est iew judgments and passing the officer cadet training s 
i Dat alone (which were known to the interviewers) a " 
ce value of 47. The combination of interview judgments 3 
a nu gave the best validity, namely, +56. Anderson (1929) er 
a ps mber of tables showing the relevance of qualities aie 
eae зан ic interview to success at salesmanship and other ae 
UM appointments, and claims that routine sec E o 
OE omitted this interview were less diagnostic. Sar m Oe 
predic other hand, quotes studies by Wittman and himse 2 и 
eae onb derived from batteries of tests OF other Бала. ү 
one e Ga to clinical judgments based on interview: m 
esearch on the prognosis of schizophrenic patients, an app 


ri © |; > 
Priate combination of items from a rating scale Ой present gm 
utcome than 


t 
redi OM better correlations with subsequent у of college 
C n . 2 1 |7. i 
lons by psychiatrists. Similarly, ш 4 quation had been 


S p 

e two tests (whose multiple regression G н 

view shed in a previous group) correlated more 115 d indicate 

that Judgments with academic achievement. This np n 

si Suggestion, halo and bias, fallacious inferences 20 al inter- 
Ens and the like, play so large a part even in profession 
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viewing that the inclusion of an interview may distort rather than 
improve vocational procedures. Conrad (1947) reaches the same 
conclusion. 

The controversy between the “clinical” and “psychometric” ` 
approaches to the study of personality is a perennial one. Allport 
(1942) contrasts the “nomothetic” and "idiographic" viewpoints 
in psychology. He claims that the objective and analytic methods 
of the former disrupt the integrated structure of the personality, 
and that the more intuitive idiographic study of the individual 
person as an organised whole makes for better "understanding, 
prediction and control." Hence the latter approach is no less 
scientific than nomothetism. Vernon (1935) describes experiments 
by von Bracken, Cantril and himself showing “that personality can 
be more accurately and consistently judged as a structured whole 
than can the separate traits or components into which it may be 
analysed." Similarly Polansky’s (1941) investigation proves that 
better understanding of people and prediction of their actions may 
be derived from “structural” case studies than from lists of test 
scores. These results do not, however, affect the present issue— 
namely, that clinical judgments of the personality as a whole, based 
on interview, may be so unreliable or so distorted that better pre- 
dictions of suitability for a particular vocation may be obtained 
from properly weighted combinations of psychometric data. Yet 
another aspect of the same controversy occurs in public opinion 
research. It seems very plausible that the free interview and obser- 
vation techniques used by Mass Observation providea better under- 
standing of people’s attitudes than do answers to a stereotyped list 
of questions, but it has yet to be proved that subjective factors do 
not play an unduly large part in Mass Observation’s collection and 
interpretation of its evidence (cf. Durant and Harrisson, 1942). 

As long ago as 1925 Viteles argued that an adequate picture of 
occupational suitability cannot be obtained from test scores or 
objective items of past history, and that the clinical approach of the 
trained psychologist is needed to take account of the other non- 
measurable factors and to integrate them into an over-all view of _ 
the candidate. Freyd (1925), however, maintained that the relevance 
of any such additional factors should be systematically investigated 
and validated in the same way as that of test scores. Wallin (1941) 
and Sarbin (1944) again state that the interviewer is really doing 
the same thing as the psychometrist—that is, combining the indi- 
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cc рив items which he believes, on the basis of past 
into a ates ee cases, to be predictive of success or failure, 
ally and es gment of suitability; but that he is doing it inform- 
Clearl Bm and, therefore, less effectively. ; 
ey as y i Y incumbent on psychologists who regard the inter- 
E valid instrument of selection and guidance to produce 
accord e evidence. Members of the N.LLP., whose views 
their dicen those of Viteles, have indeed proved the validity of 
mM E guidance which largely involves interviewing, but 
without i emonstrated. that they could not predict as effectively 
is ie Yet another important query that we must try to answer 
is рено. an interview by а trained psychologist or psychiatrist 
enced lor to one by an ordinary industrial employer or an experi- 
ed officer in the Forces. 


Reliability of Service Interviews 


No data are available comparing two independent interviews by 


Che hologists or P.S.O.s except those listed in the previous 
fie (Table VI), but there is evidence of considérable T 
icd m standards of judgment. ‘Thus, the overall proportion о 
who Tecruits marked “P” (unstable) by Wren recruiting assistants, 
the Ne uewed some 80,000 men in 1943, was 2-69 per cent. But 
1 $00 nge of 101 different Wrens (each of whom interviewed 300 to 
05 men) was from 0 to 12-9 per cent., and the quartile range 
er Per cent, to 3-5 per cent. Each Wren had a fairly steady rate 0 
hal own, the correlation between the “Ps” in the first and eU 

Ба the period being "79. It was also found that Wrens judge 
Y their supervisors to be high on such traits as carefulness, steadi- 


nes ELIS 

thel conscientious, reliable, experienced, and teachable, mix 

216 largest numbers. The rate varied considerably with area—tha' 
d a common 


is, 
БОГУ the Wrens working in а given area һа 
um but there were significant individual differences over 
S above this. 
E results should not be regarded as а $6 
NS since, owi: Б :cability 0 
tric j nce, owing to the impractical ility of ing Ps 
imeem of “P” men, little stress was Jaid on this аш i 
ful ‘S. Royal Arthur, P.S.O.s were encouraged to refer al : ji 
yea to the psychiatrist, and among nine of them, each o К pa 
tan rviewed 500 to 1,200 recruits during 1944, the referra ur 
Bed from 2-8 per cent. to 97 per cent., the total rate for 0 


rious criticism of the 


{ arranging psychia- 
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8,000 recruits being 6-3 per cent. The psychiatrist classified these 
referrals as follows : : 
5-7 per cent. were considered as suitable only for immediate 
discharge on psychiatric grounds. 
17.8 per cent. were judged likely to give six months’ service or 
less before breaking down or showing criminal tendencies. 
49-5 per cent. were borderline, being likely to give at least a 
year's service; they were, however, definitely thought worth 
referring. 
21-0 per cent. were found to possess no psychiatric abnormality. 
These figures suggest fairly close agreement between the P.S.O.s' 
and psychiatrist’s judgments of instability, though in the absence 
of a control group of men not referred, no exact correlation between 
them can be calculated. The numbers in the first two groups 
referred by different P.S.O.s also varied from 11 per cent. to 37 per 
cent., and it was noted that, on the whole, P.S.O.s who sent the 
biggest total numbers also picked the biggest proportions of these 
serious cases. In the Army, P.S.O.s referred 14 per cent. of recruits 
to psychiatrists in a representative period, but their variability was 
not checked. Large variations were found, however, in the pro- 
portions of officer recommendations made by different 
P.S.0.s 
Evidence of the reliability of psychiatric interviews in the R.A.F. 
is given by Hill and Williams (1947) in a study of 541 cases who 
were seen by more than one of thirty-seven psychiatrists at 
different centres, at intervals ranging from several days to a few 
months. Among the items noted by the psychiatrists were diagnosis 
or “reaction type" under one of six headings, and degree of pre- 
disposition to neurosis—severe, mild or none. Of the diagnoses 
81 per cent. were in exact agreement, but the figure varied con- 
siderably in different reaction types. If the average agreement in 
four main types is taken—anxiety, depression, hysteria, and others 
—it drops to 685 per cent., corresponding to a reliability coefficient 
of -76*. The reliability of judgments of neurotic predisposition was 
somewhat inadequate, corresponding to a tetrachoric correlation of 
-58. Nearly half the men assessed as “severe” by one psychiatrist 
were called “mild,” or occasionally “nil” by the other. Further 
judgments, such as degree of flying or other stress, showed inter- 
mediate degrees of agreement. 
* This was calculated by Burt's (1945) matching formula. 
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Validity of Recruiting Assistants’ and P.S.O.s' Interviews 
А The ability of Wren recruiting assistants, using the questions 
listed above, to pick psychiatric cases was studied in an experiment 
where four Wrens interviewed 147 actual psychiatric in- or out- 
patients mixed with 175 controls (Curran and Roberts, 1945). On 
the basis of their enquiries into pre-Service history only they 
picked 58 per cent. of the cases and 16 per cent. of the controls as ~ 
suspect. But it was later discovered that of the 16 per cent. “false 
positives,” most of whom were seen by a psychiatrist, over half did 
actually possess some considerable degree of abnormality. The 
Wrens missed a rather high proportion of the affective psychotics 
and neurotics, but managed to pick up twenty of the twenty-four 
hysterical patients. Obviously this experiment could not prove that 
the Wrens would have been equally successful in diagnosing 
breakdown before it occurred. Another weakness was that the 
four Wrens, though not necessarily the best available, were known 
to be generally good at their work, arid others might not have done 
so well. It should also be noted that their success was not any 
greater than that of the various personality inventories which 
have been tried out as screening tests (cf. Chapter XV). 

Variations in success of interviewing were shown when 1,631 
naval radio and electrical mechanics and wiremen, who had been 
selected by thirteen different P.S.O.s, were followed up. The 
failure rates during their training averaged 24 per cent., but ranged 
from 18 per cent. among men selected by the most successful 
P.S.O. to 37 per cent. for men selected by the least successful. This 
study, however, is incomplete, since we do not know how many 
potentially suitable men were missed. Conceivably the P.S.0. of 
whose selectees only 18 per cent. failed was more choosy” than 
the rest, and had she accepted larger proportions of her inter- 
viewees her failure rate might have risen. In an analogous investi- 
gation in the Army, nearly 3,000 W.O.S.B. candidates were 
“followed back” to the P.S.O.s (or other officers) who had origin- 
ally recommended them. Although the numbers interviewed by 
any one P.S.O. were rather small, it was possible to show that there 
Were significant variations among P.S.0.s both in the proportions 
of suitable recruits whom they recommended, and in the failure 
rates of their recommends at the W.O.S.B.s; also that the more 
“choosy” P.S.O.s did not necessarily pick better candidates. 

In seven investigations it was possible to correlate both test 
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results and P.S.O. judgments of suitability, based on interview, 
questionnaire and tests, with subsequent training course results or 
proficiency grades, and in only one of these did the judgments 
show better validity than some of the tests. Thus, among 730 Army 
personnel trained as drivers the correlation between P.S.O.s' 
assessments and pass or fail was -234, while the validities of the 
Bennett test (S.P.2) and age (youthfulness) alone were -294 and -288. 
Gradings by a P.S.O. with engineering experience were available 

‚ for 1,014 boys starting training as Army tradesmen; compared with 
practical marks over the next 1-3 years their validity was :272, 
those of the Bennett and Squares (S.P.4) tests being -294 and -288. 
However, the gradings were not intended to predict general suit- 
ability for trade training so much as to advise the training schools 
on the best type of course for each boy, hence this comparison may 
not be entirely legitimate. 

The final course results of 411 naval radar plot operators were 
collected (cf. p. 244), and the correlation of these with R.S.O.s’ 
assessments was 374. The correlations for T2, a scale reading test 
and a graph reading test were “416, -430 and -400 respectively. If 
'T2 is held constant, the partial correlations with the criterion, i.e. 
the extent to which the P.S.O.s or the other tests can improve on” 
T2 are -137 for the P.S.O., -172 and -193 for scale and graph 
reading. It should be noted that in this instance the P.S.O.s had 
rejected a (not very large) proportion of recruits as unsuitable for 
training; hence those that did 8o forward were more highly 
selected on P.S.O. judgments than on T2, also more selected on 
T2 than on the other two tests. Probably if allowance could be 
made for this selectivity, T2, P.S.O. and scale reading would all 
show almost the same validities. 

Table XX XIX (p. 246) lists the correlations of P.S.O. judgments 
of 601 radio mechanics and 342 electrical mechanics with success 
or failure on course, together with the validities of several S.P. 
tests among some 300 and 200 of these men. Again the P.S.O.s tend 
to be surpassed by T2 and by information tests which are directly 
relevant to the jobs. But here, too, selectivity imposes a greater 
handicap on the P.S.O.s than on the tests. Similar investigations: 
in the U.S. Navy yielding equally unfavourable results are 
described by Stuit (1947). ; 

The most successful judgments were made by one highly experi- 
enced P.S.O. She graded 460 men who had been sent on course as 
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safety equipment ratings, without seeing either the men or their 
test scores, but only from their questionnaires. It should also be 
noted that the relevance of particular questionnaire items had not 
been studied in any previous follow-up investigation. Neverthe- 
less, the correlation between her judgments and subsequent course 
marks was -554, whereas T2 alone gave a correlation of :393 and 
No other combination of the standard tests would have improved 
appreciably on this. Her correlation with T2 was :312, hence it 
Was' clear that she was not merely assessing general ability from 
йош education and occupation, but was effectively judging the 
atacter traits and interests relevant to the job. 


Validity of Psychologists’ Interview Judgments 
ee he superiority of psychologists’ interview judgments both to 
int кав by a board of naval officers and to predictions based on 
elligence tests was indicated in a study of 503 R.N.V.R. cadets. 

si d E majority of these were seen by Wilson, who used the 
ps he 71 outlined above, and the remainder by other Admiralty 
eue тз. T2 scores were available, but not the reports, 
е, ly prepared for the Admiralty Board on the ERI. 
as Sane record in the Navy. The boards, on the other hand, had 
candid. as these reports, the psychologist’s reports and T2s. Only Ke | 

езе al were allowed to proceed for training by the boards an 9 
Psych, i per cent. passed. Of the 176 favourably graded by t К 

y enologist 71 per cent., and of the fifty-two unfavourably grade 
`5 Tx Cent., passed corresponding to a tetrachoric correlation 
Were М None of the candidates unfavourably regarded by E 
cortelatio, for training, its predictions cannot be ехргеззе Rn 

e. B Ion, and the two percentages 61-4 and 71 are aot eomr i 

Psych E if all candidates had gone forward, the anes m 

at would almost certainly have been more mar e 
a validi Within this group of 228 on the basis of T2 alone g 
idity coefficient of only :24. 
quA Small W.O.S.B. investigation of officers who suffered 


PSychi4..: i 

үш breakdown, psychiatrists, psychologists, tests, and a 

1541 board members were compared. Eighty-nine cases W 
chologist's judgments 


ы. Since s 

Corres the agreement Ч the board's and psy ee) i 

еге SD toa eiu а ra it follows that the psychologist 1 Бае 

9f c. Шааке only in a highly Selected group. Thus in a less CES d 
2 is al: ates his validity coefficient would certainly rise. The low va up of 

415 canes largely due to selectivity. Among an earlier, unselected, 210! 

!dates its validity was *70. 
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traced who broke down within 13 years after passing their boards, 
and the judgments made at the time of boarding were looked up. 
"Neither their final board gradings nor the president's, or M.T.O.s’ 
opinions of them were any poorer than of normal acceptances. The 
psychiatrists had recommended the acceptance of 71 per cent. of 
them, but had given a significantly larger proportion of adverse 
reports than to normal candidates. The psychologists (sergeants) 
had not seen the cases, but had studied their “pointers” material 
and had actually recommended the rejection of a larger proportion 
than the psychiatrists, namely, 48 per cent. Since, however, the 
psychologists tended to give more adverse reports on normal can- 
didates also, the validity of their predictions was probably much 
the same. Intelligence test results were also sub-normal, 14 per 
cent. instead of the normal 6 per cent. having an O.I.R. of 4 or 
below. No one would expect such tests to predict breakdown, but 
in this investigation numbers are insufficient and the data too 
Incomplete for it to be proved that psychiatrists are better than 
psychologists, or vice versa, or that either are better than tests. АП 


three, however, are significantly better than “Jay” interviewers ОГ 
observers. 


Validity of Psychiatric Judgments 
. In 1941-2 several experiments were carried out by T. F. 
Rodger, Wittkower and other psychiatrists into the assessment of 
officers who were attending the Company Commanders’ School 
near Edinburgh. The Commanding officer and his staff were able 
to provide exceptionally thorough descriptions of the personalities 
and future promise of these officers at the end of their training. А 
psychiatrist interviewed each man for an average time of just under 
an hour, and wrote an independent diagnosis. On comparing the 
Psychiatric and the school’s opinions close agreement was claimed 
in eighty-fiveoutof 100 cases, Ina similar study of 223 officers certain 
intelligence tests were also given—Group Test 33, Matrices and/or 
Army S.P. Test 15. On matching the two sets of reports close agree- 
ment was claimed in 52 Per cent., substantial agreement in 36 pet 
cy in 6 per cent. and divergence in 6 per cent. 
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| Elsewhere Bowlby carried out a similar, but better controlled, 
Investigation of thirty-six O.C.T.U. cadets, to whom he gave the 
ЕН. test and Matrices and a twenty-minute interview. He 
Predicted their actual O.C.T.U. passing-out grades correctly in 
twenty-eight cases, and was seriously incorrect in only two cases. 
At that time, however, the cadets were very heterogeneous, having 
been selected by old-type interview boards, and it was found that 
Ше intelligence tests alone would have caught ten of the thirteen 
men who failed training. Thus, both the psychiatric interview 
and the tests achieved validity coefficients of around 80. 

In 1943-4 psychiatrists took part in the selection of Army 
Parachutists and, in order to gain greater familiarity with the job, 
themselves underwent parachute training. Brief interviews and a 
Shortened version of the W.O.S.B. “pointers” procedure were 
used. The trainees were graded 1 to 5 for suitability. On following 
“Р 1,492 cases, of whom 20 per cent. failed their first stage pf 
pining, it was found that 3 per cent., 7 per cent., 10 per cent., 

3 per cent. and 46 per cent. of men in the psychiatrist 5 five 
Brades failed, This corresponds to a validity coefficient of -58. But 
Er lowing up a year later the agreement between the grades and 

H Wastage (on medical, disciplinary or training grounds), or 
BE notion to N.C.O., was less close, corresponding to correlations 
ctween -2 and -3 (cf. Expert Committee, 1947). 
т 5 small experiment in the R.A.F. is reported by Rollin пөш), 
th enty-five W.A.A.F.s who failed trade training for reasons a i 
c dullness, and 100 normal controls were interviewed, an en 
Sy, made of Gillespie's scheme (p. 181). The most diagnos 
Mptoms appeared to be: 


Failures | Controls 
per cent. | per cent. 
Hi n 
Nerva? of instability in family 3 - 3 Я A a 
Pression’ poms before enlistment (morbid fears, de- 56 Е 
X i eee Me ee m t m » 
пуа nervous o РИКС d 25 3 
ersonalin Suggestive of neurosis .. ro D 2 
uate) У difficulties (over-aggressive, passive or ina ч a ; 
Unsettled ы? hyness, over-dependence or worrying - - M $ 
d home (e.g. ч p 
cupational ieee to uncmiployment) : : Ч | 
Thi i 
7а и 1 1 оша олу 
е Dalysis is, of course, ex postfacto, and its validity с 


Sats Т Кеа 
Proven if it was tried out on fresh cases before their training 


P.S.—6 
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started. The relation of neurotic disposition to pee н а 
ency was shown very clearly by Reid. Either the medical о: ud 
Reid himself interviewed 200 airmen on arrival at a RAF. sta d 
and they were followed up through a tour of operations. Two o 
more neurotic symptoms were found to occur in 20-4 per cent. 
those who had accidents ог became casualties, and in варе с 
of those who developed psychiatric breakdown, but only in 54 ЫЗ 
cent. of those who successfully completed their tour. This one 
sponds toa tetrachoric validity coefficient of -+ -55. Finally, a lar ge 
scale trial of the predictive value of psychiatric interviews ш, 
carried out in the R.A.F. towards the end of the war. Of the а 
men seen by two psychiatrists, only a few could be followed үй t i 
operational stage. The main finding was the extent to whic 


the validity of the judgments depended on the personal flair of the 
Psychiatrist (Williams, 1947), 


Conclusions 


of the interview yi 
metric approach 


Н D» d 
given to the qualities to be assessed and the methods ‘used, an 


H zl H H H $i 1 а 
when a certain amount of objective information is available as 
Starting point, the consistenc: 


I 
Рег cent, except Perhaps, when the candidates are abnormally 
heterogeneous, It į 


judgments with test scores than from either dbr 
nd psychiatric interviewers are almost RET у 
gically untrained persons shee 
cers in the Forces, It is doubtful whet 


i : 9.0.5 (even when carefully selected and үги) 
Constitute a valid vocational technique, Probably some P.S.O.s 2 
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quite.as good as psychologists and psychiatrists, but there are such 
large variations that it would seem wise to keep their diagnostic 
functions to a minimum and to rely to a greater extent on psycho- 
metric predictions whenever possible. None of the evidence that we 
have collected suggests that their subjective summing up of past history 
‘and their personal judgments are capable of covering all the relevant 
factors in vocational classification omitted by tests. 

As shown in the previous Chapter, the interview is essential for 
checking the correctness of questionnaire entries, for giving infor- 
mation, for matching supply and demand, for improving morale*, 
and often because it is a vested interest. Doubtless it has valuable 
applications also in treatment, counselling, research, etc., and in 
providing hypotheses as to the occupational relevance of various 
Symptoms—hypotheses which should be subjected to empirical 
verification. But we are forced to conclude that its value as a diag- 
nostic technique is, as Hunt, Wittson and Harris (1940) put it, 
à matter of economics. Although thorough psychometric pro- 
cedures might usually be more accurate they would involve such 
extensive and complicated research, and be so costly to apply, that 
they may be impracticable, in addition to being unacceptable both 
to candidates and to employers. The interview has the tremendous 
advantages of inclusiveness, speed and flexibility. It takes into con- 
Sideration a much wider range of factors than can readily be tested 
(though admittedly many of these are irrelevant and most of them 
liable to misinterpretation). It requires no apparatus, not even 
printed blanks. In the U.S. Navy, according to Hunt, interviews 
averaging only three minutes in length, served for screening 
Psychiatrically suspect recruits. It was found that a single psychia- 
trist could deal with 100 cases a day, though many could not 

‘keep up this pace for long. Even with ten- to twenty-minute 
Sessions, an interviewer is, likely to cover fifty men in no more time 
than a téster would take to give, score and weight the appropriate 
Score combinations on, say, a two-hour battery, which would have 
required months of technicians’ time to prepare. True, these inter- 
Views might make more mistakes than the objective procedures, 
but the sufferers (whether employers or employees) tend to resent 


* e . 
x For example, Closson and Hildreth (1944) have shown that recruits who 
Receive a five-minute psychiatric interview, designed to discover their “weak 
Pots" and to give advice accordingly, make better adjustments in the Forces 


than comparabl i 
methods, P fable men who have merely been selected by impersonal test 
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errors of human judgment much less than they do errors made by 
impersonal instruments. 

The interview again can readily adapt itself to the exceptional 
case, e.g. the illiterate whose test results or questionnaire responses 
are quite misleading. It can be applied in employments which 
involve small numbers, where adequate validatory evidence for 
psychometric procedures is unobtainable, and in such fields as 
managerial, officer or air crew selection, where it is particularly 
difficult to devise tests of character or temperamental qualities. 
Finally, when the requirements of any employment alter, the inter- 
View can be adjusted at once, whereas the psychometric approach 
involves a complete re-investigation of the validity of all the poss- 
ibly relevant test scores and biographical items, a fresh statistical 
analysis and preparation of a fresh weighting system. As we have 
seen, such .re-investigation cannot properly be carried out on 4 
selected group; it is hardly feasible to work out minor adjustments 
In a battery while that battery is in actual use for selection. For all 
these reasons, the interview rather than tests must remain the 
prime instrument of vocational classification, and vocational 
psychologists are justified in spending as much of their time 0n 


attempts to improve the technique of interviewing as on devising 
more valid tests, 


CHAPTER X 
PRINCIPLES OF PSYCHOLOGICAL TESTING 


Abstract.—A psychological test, by presenting a standardised task 

or situation, elicits a sample of the testee’s behaviour which can be 

objectively scored and compared with norms of performance, and 

which has been proved to be predictive of future occupational or 

other behaviour. To regard it as measuring particular traits or 

abilities (the “naming fallacy”) is often misleading. Its content is 

best determined by a combination of systematic introspection, 

empirical validation, and factor analysis. The types of tests found- 
most useful in vocational classification are group paper-and-pencil 

tests for measuring the factors underlying a large number of jobs, 

information or trade knowledge tests, and supplementary “minia- 

ture situation” tests, each closely analogous to some important job. 

Given a standard battery of group tests, the planning and con- 

struction of new tests is guided not so much by job analysis of the 

aptitudes involved as by gauging how far this battery needs to be 

supplemented, and by considerations of practicability, i.e. the 

availability of materials and time for devising, giving and scoring, 

etc. The main steps in such construction are outlined. Adult 

tests are generally calibrated (provided with norms) in terms of 
percentile levels, rather than in mental ages, I.Q.s, or “critical 

scores." There are great difficulties in indicating the relevance of 
Scores, especially on a battery of tests, to particular jobs. So-called 

wastage coefficients” and minimum standards are unsatisfactory. 

. Tables or histograms which show the optimum score levels on the 

most valid tests, and the probability of job success at each level, 

are advocated. 


The fundamental method of science is observation of the way 
things (living or non-living) react in specified situations, and this 
is the method of psychological testing. The tester does not deduce 
People's traits or abilities from bumps on their skulls or from their 
Physiognomy, nor does he rely on hunches or subjective impres- 
S1ons, but records their reactions to a standard task. At the same 
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i i alin; 
a test is in no sense a miraculous instrument for revealing 


otherwise unsuspected qualities; it is simply a pu 
process of observing behaviour which we use every dr а 
judgments about our acquaintances. As pointed out by 

(1940) this refinement involves: 


(1) 
(2) 
(3) 


(4) 


H : e react 
Standardising the task or situation, so that all testees г 
to the same task. : д 

i i orm 
Objective and unbiased recording of the testees’ perfo 
ance, usually in quantitative terms. | ith 
1 wit 

Availability of norms or standards by comparison v 


г ап Ье 
which the goodness or poorness of performance с 
assessed. 


Re ; : : cise 
Demonstration that the test is both reliable—i.e. a рге 


Ty. se 3 icts 
measure, and valid—i.e. that it actually measures or pred 
what it claims to do. 


Certain other points should be noted: 
(5) This view of tests incl 


(6) A test Constitutes onl 


Е : ts 
udes intelligence or educational ос 
where the recorded behaviour consists of answers to i 


lectual problems, as well as practical vocational tests, а 
medical tests of eyesight, hearing, etc., which may p. 
bearing on vocational suitability. It does not cover so-ca cit 
projection tests or other Clinical techniques whose objec 
to provide the psychologist with insight into the NU 
personality (cf. Thematic Apperception, p. 57), or to c We 
im to judge the testee’s methods of work (cf. AN illed 
Struction, p. 32). Such instruments are valuable in sk ing 
hands, but are not Strictly tests, unless objective scor 
methods are applied, s 
y a relatively brief sample, or e 
r. To make sure how well а man itin 
ation we should need to let him try de 
€ this would take so long we ешр, 
f tests, It naturally follows that e sive 
priately chosen battery of tests, WI Eo 
5 of proficiency at the DE UP d ro- 
our elicited by the test and vocationa Phat 
pend опа multiplicity of factors, WT. 
Y à certain statistical probability ш "t the 
and such scores will succeed or fail. there 
alidity the odds may be very high, but 


samples, of behaviou: 
do in a certain оссир; 
real life, but becaus 
short cut method 0; 
nor even an appro 
Perfect prediction 
the sample behavi 
ciency itself de 
_ there can be on] 
obtaining such 
test has good y 
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' "manual dexterity,” “visual discrimination, 
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can never be complete certainty about any particular 
individual. 

So far as possible a test should be named in terms of the 
operations it actually involves, rather than in terms of 
traits or abilities which it is supposed to measure. It is 
scarcely possible to discuss or analyse vocational suitability 
without bringing in personal qualities like. intelligence, 
attention, dexterity, persistence, etc.; but we should realise 
that these are hypothetical constructs. Such terms, which 
are intermediate between the observed test behaviour and 
the predicted future behaviour, are often unnecessary and 
misleading. The “naming fallacy” has been the bane of 
vocational psychology, witness the ubiquitous use of : 
? “attention to| 
details," and the like. Factor analysis investigations indeed 
show that there is so little overlapping between different 
tests of dexterity that it is doubtful whether it should be 
credited with any general and independent existence. Such 
analysis does, however, suggest that certain abilities are 
sufficiently consistent and distinctive to constitute useful 
concepts for the psychológist's vocabulary, and these are 
usually denoted by letters such as g, k, or v, rather than by 
names of dubious significance such as intelligence, spatial or 
verbal ability. 4 


There are, in fact, three methods of deciding what a psycho- 
logical test measures, all of which are usually employed: 


(i) 


(i) 


The method of a priori or theoretical analysis. Inspection of 
a test merely gives us what is called its “face value,” which 
has little or no scientific worth. The collation of systematic 
introspections about the test from several trained observers 
may be more useful, but even the views of psychologists as 
to the physical or mental processes involved in taking the 
test may be invalid when the prospective testees are children, 
or adults of sub-average intelligence. Chapters XII-XV 
give examples both of tests which failed to measure the 
abilities anticipated, and others which measured unsus- 
pected abilities. : 

Follow-up evidence, i.e. the comparison of test results 
with success or failure in various jobs. Though this is 
the most scientific method, it too has its weaknesses—the 
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educational type (v : ed factor) and the ence investiga- 
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group factors. The former includes p mes and 
usually overlapping, verbal, numerical, imary school) 
Secondary School (as contrasted with pri ical spatial, 
abilities. The latter group includes ere A The 
informational, physical, manual, and other s abilities 116 
extent to. which a test depends on any of pae: ressed by 
its factor loadings or Saturations, may be pne be 
correlation coefficients*. Such figures should, ce they are 
accepted only with the greatest caution, m e of ability 
largely dependent on the heterogeneity Ok Та pe of tests 
in the testees studied, and on the pun frequently 
Chosen for analysis. Further investigation euer and, as 

reveal the presence of additional group factors, A 
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vidual testing is possible, and as big numbers are needed for 
trying out and validating any test (whether it is to be rarely or 
frequently applied), the tendency among vocational psychologists 
1s to convert more and more of their instruments into group form. 
Even the interview, for example, is partly displaced by the bio- 
graphical questionnaire which can be filled in by a group under 
supervision. Mechanical assembly tests employed in the Army and 
A.T.S. were adapted so that eight testees could be dealt with 
simultaneously. With sufficient ingenuity there is no reason why 
the majority of performance and apparatus tests should not. 
become automatically self-scoring. Another reason for this tend- 
епсу is that fully qualified psychologists cannot afford to spend 
much time on testing; tests, and their scoring, must, therefore, be 
simplified as much as possible, and the personal element in indi- 
vidual testing reduced, so that accurate testing can be done by less 
highly-trained individuals (cf. U.S. War Department, 1946), 
Paper-and-Pencil v. Practical—The former includes, besides 
verbal intelligence, educational or trade knowledge tests, non- 
‚ Verbal g and spatial judgment tests based on abstract diagrams, 
and others—particularly in the mechanical field—based on 
Pictures. Some tests are intermediate—for example, morse or code- 
learning tests—involve the practical discrimination of auditory 
patterns and written responses. Owing to the difficulties of con- 
Structing reliable apparatus or performance tests in war-time, and 
of Maintaining sufficient sets in standard condition for application 
to thousands of recruits a week, there was a tendency to rely very 
largely on paper-and-pencil tests. This occurred, too, in America, 
where the difficulties were less acute. Commenting on the same 
tendency after the first World War, Hull (1928) surmised that a far 
greater variety of behaviour could be sampled by performance and 
apparatus tests. Admittedly there may be some danger of handi- 
Capping the “handyman,” or the recruit with manipulative skills 
who, perhaps through no fault of his own, left school early and is 
inefficient in reading and writing. Whether we are favouring the 
bookish” man in all jobs, both in the Forces, the Civil Service, 
the police, industry, etc., is a problem that needs serious investiga- 
Чоп. But the danger should not be exaggerated, since “practical- 
ness” is another rather dubious construct or hypothetical trait. 
Certainly a man may do better on a performance test of intelligence 
than on a written one, or ona manipulative assembly test than on a 
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: i = vet of the 
pictorial mechanical one. But there is e e ie acest ie 
superiority of such a man in engineering or bcm rM. 
that the introduction of practical tests may not т а Ж 
the economic standpoint, though, nevertheless, о е w—Á 
the grounds of “sales value,” i.e. attractiveness bot 
to employers. ы M . uM le” 
ИР tests may be subdivided into id d 
and “non-expendable.” In the former the test blank itse Ka 
in the latter answers are written or (as in American machine к: 
tests) marks are made in the appropriate positions on pie 
answer sheets, and the test blanks themselves can be use s die 
times over. Non-expendable tests are often preferred es ly 
Save paper and are more easily scored, but they un ae де 
impose an additional handicap on duller testees. The ne -— 
inventive response type of test item is also more suitable ultiple- 
testees, and was largely employed in British tests because m ЫП 
сһоїсе ог selective-response tests and examinations are venti 
unfamiliar in this country. But it is difficult to construct inv 
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interest in, and aptitude for acquiring further skill in, mechanical 
jobs than did any tests belonging to the more conventional aptitude 
category. Similarly in predicting university achievement, it is often 
found that previous attainments and tests of such generalised study 
skills as reading comprehension are of more value than intelligence 
tests (cf. Vernon, 1939a; Eysenck, 1947b). 

Analytic, Analogous and Work-sample Tests —This classification 
of vocational tests overlaps closely with the previous one. Analytic 
tests of the elementary sensory and muscular functions believed to 
be involved in a job, or analogous tests which parallel the main job 
operations, are the generally accepted tools of vocational selection, 
since they are supposed to be uninfluenced by the unequal experi- 
ence or training candidates may have had. While numerous investi- 
gations favourable to such tests have been reported, particularly 
by German industrial psychologists, they do not bulk largely in 
most modern vocational schemes, for several reasons. 

(1) There is clearly considerable truth in the claim of Gestalt 
Psychologists that every industrial operation is a whole which 
is more than the sum of its elements. Proficiency depends 
less on the sensory-motor capacities covered by analytic 
tests than on the successful integration of such capacities. 
Analogous tests also often give poor predictions because the 
structure of the abilities which they test differs in some 
essential respect from the structure of the job itself. 

(2) The analytic or analogous approach is extremely susceptible 
to the “naming fallacy.” 

(3) Vocational classification usually has to cover so great a 
variety of different jobs that it would be impossible to devise 

! Separate batteries of tests even for the main ones, or to put 
all the candidates through several batteries. The only prac- 

, ticable approach is to test the common factors ог generalised 
abilities which enter into a large number of jobs by a limited 
battery of tests, and then, perhaps, to add one confirmatory 
test in each main job which can be applied to likely can- 
didates for that job. t 

(4) This approach is justified by the findings of factor analysis. 
In a heterogeneous population such as recruits or school- 
leavers, factors such as g, v : ed and k :m, when supple- 
mented by biographical data, cover a major proportion of 
the ground, and the addition of specialised tests for particular 
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jobs adds relatively little to the success of predictions. bs. 
may be more justification for applying batteries of spec E 
tests in vocational selection where =a form 
more homogeneous group (cf. Chapter - 
“Work-sample”™ tests appear to be differently interpreted d 
different industrial psychologists. Thus Viteles (1932) implies a 
work-sample testing involves giving the candidates some Pd 
liminary training on the job itself and deducing their Wr 
proficiency from their initial results or rapidity of learning. x. 
use the term of any trade test such as the dictation of a stan! lof 
passage at standard speed to test stenographic ability. M 
these were widely used in the Forces for evaluating the 
experience of recruits. ПОТА 
We may conclude that the most appropriate tests for voca 
classification are of three kinds: P M 
(i) Group tests for the main factors underlying proficiency; 
(ii) Work-sample tests and tests of trade knowledge; 5; 
(iii) Supplementary tests intermediate between the analogue it 
“and the work-sample type, which attempt to жеоо nob 
Were in miniature, certain essential features of the jo 16), 
covered by (i). Of those described below Agility de аў 
Morse Aptitude (Test 10), Asdic gramophone records The 
the N.LI.P. clerical test, come under this heading. - ing 
fact that these were to some extent influenced by e 
or experience did not greatly detract from their Mes Nis. 
because recruits with relevant experience were foran an 
So much more readily trainable (cf. Chapter УШ) ives 
because, as Pointed out above, attainment generally 8 a 
useful predictions of aptitude. Moreover, there Ms. in 
safeguard that the P.S.O. could interpret the test кы did 
the light of Opportunities. A high score on, say; Cle 


: tian, 
WE h : ivilia 
test Would be more significant in a recruit whose € 
job was non-clerical. 
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Start with a detailed job analysis of the methods of good a ic OF 
he job, to construct a large battery of analyt 
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Hull, 1928). We would suggest that this applies only in vocational 
selection schemes whose scope is limited to one or two particularly 
Important jobs, the applicants for which are fairly homogeneous in 
& education and experience. Certainly the psychologist concerned 
with vocational classification should try to study workers on the 
job, and should consult supervisors, training staff, etc., regarding 
the nature of the work, but with a rather different aim in view. 
First he needs to know as much as possible about the physical and 
social conditions under which the job is performed, and, secondly, 
he should try to assess what qualities are not likely to be covered 
by his standard tests and by biographical information. In radar and 
asdic Operating, for example, even when these have been given full 
weight, there are likely to be left over certain visuo-perceptual and 
auditory capacities which do demand special tests. Both jobs, too, 
appear to demand a lot of knob-twiddling and concentration of 
attention, but the experienced psychologist recognises that the 
effort to try to measure such qualities would probably be wasted. 
If, as in these instances, additional tests appear desirable (or if 
follow-up evidence shows that his predictions are insufficiently 
accurate), these tests are designed, not so much to measure hypo- 
thetical aptitudes as to duplicate as closely as possible the remain- 
ing important aspects of the work. Thus, in planning tests for asdic 
Operators (cf. p. 26) it was mistaken policy to list sense of pitch, 
Sense of timbre, auditory threshold, etc., and to apply tests like 
Seashore’s which claim to measure these. In fact, such tests merely 
acted as rather inefficient tests of general intelligence. But by con- 
structing gramophone records with asdic noises which resembled 
the discriminations needed in the job itself, which were also so 
Simple that their dependence on g was low, a distinct contribution 
Was made to the prediction of operating efficiency. 

In the planning of tests for vocational classification, there are 
two factors which are on the whole more important than detailed 
Job analysis, namely, the experience of the psychologist, and prac- 
Ucability. The psychologist needs to know what kinds of tests have 
Worked well in the past in selecting for similar jobs, and should be 
familiar with the evidence from American and other relevant 

Iterature, 

The economic aspects of testing have been recognised by many 
Writers, though hardly sufficiently stressed. Hull admits that 
Progress in testing is likely to depend as much on the reduction of 
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testing costs and the development of mechanical methods of 
scoring and. of weighting scores, as on the predictive value of the 
tests themselves: Viteles points out that it is not worth while 
embarking on a programme of testing unless a sufficiently large 
number of sufficiently co-operative workers can be ensured on 
whom to try out and validate the tests, and unless a satisfactory 
criterion of their proficiency is available. In the Forces, for example, 
it appears so difficult to obtain an adequate measure of officer 
ability (nobody knows, or no two authorities agree, as to just what 
constitutes a good or a poor officer) that it is hardly worth attempt- 
ing to devise tests of officer aptitude. Other important considera- 
tions include: 

(1) What kind of tests can be given and scored by the available 
staff in the available time, both at the experimental trial 
stage and in routine practice ? Most of the Service tests listed 
below are very short (ten minutes or less) becayse of the 
restrictions on testing time and because of the danger of 
recruits becoming “fed up” if over-tested. Even the effects 
of practice which would arise if recruits took several hours 
of tests are shown to be Serious, in the next Chapter. es 
much relevant information as possible should be obtaine 
from questionnairés and interviews which do not look like 
tests, even if, in fact, the answers are standardised and 

validated in such a Way as to constitute additional tests- 
Headmasters similarly tend to resent allotting more pur 
an hour or two to the testing of school-leavers. Promising 
tests of interests and of neurotic tendencies could not be 
introduced in the Army because their scoring (in the absence 
DII MM 
r materials be forthcoming Я 
future as well as at present? Can apparatus be maintained 5 
Proper working order? The paper shortage in the war сой 
siderably restricted the use of tests containing spatial Е 
mechanical diagrams and pictures, and the diversion 


almost all skilled mechanics into the Forces or into А 
ments production о 
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results in different places. A complex reaction time test for 
motor drivers (Chleusebairgue, 1939), “miniature situation” 
tests for layers and for radar operators, tests of ability to talk 
over the telephone, and tests of night vision (as distinct from 
dark adaptation) were others which were regarded as valu- 
able but impracticable. 

(3) Can the psychological and statistical staff cope with the work 
needed in devising, analysing and calibrating a new test? 
Is it likely to be sufficiently superior to older established 
tests to justify this expenditure of technicians’ time? 

(4) The range and general level of intelligence in the population 
for which the test is intended must also be borne in mind. 
Some tests which might be quite practicable at upper levels 
may need far-reaching modifications and experimentation to 
be got across to duller men. Without this they may merely 
measure g over again, or be highly susceptible to practice 
effects. A warning has already been given against separate 
answer sheets and selective-response items. Figures, dia- 
grams and drawings need to be large and clear; thus, here, 
too, lack of paper may be a limiting factor. 

_The Production either of a general or a supplementary test is a 
highly complex and technical business, and we can only touch on 
а few points here. When the purpose and nature of the test have 

сеп decided, suggestions for items should be collected from 
Several psychologists or other soutces, and they should be carefully 
Scrutinised and revised to eliminate ambiguities, to ensure tech- 
ШОП accuracy, and to cover the requisite: range of difficulty. А. 
qi €xcess of spare items should be included, and if alternative 
5 ms are likely to be needed, it is preferable to devise and try them 
md a simultaneously. Suitable instructions, for the testees and 
x8 the testers, must be prepared and time limits decided, subject 
ER in the light of preliminary trials. Usually at least 
Dunn arge-scale trials, on populations which number several 
i ndred and which parallel the populations for whom the test is 
ntended, are needed for the following purposes: 

(1) To see if the layout and instructions are adequate, and 
Whether-there are gross defects in any items. 

(2) To validate the test as a whole and to show that it adds 

‚ appreciably to other existing tests, or is superior to other 
Proposed tests. 
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(3) To establish the difficulty, also the consistency and/or 
validity of each item. | 

(4) To validate the final form or forms and determine its pre- 
dictive value in one or more jobs. 

(5) To find its reliability. } 

(6) To analyse the main factors it involves by comparison with 
other tests. 


(7) To calibrate, or determine its norms. 


Calibration 

Neither the obtained “raw” score, nor the percentage score on а 
test or examination provides a precise indication of the goodness ОГ 
badness of performance. The results of intelligence or educational 
tests among children are, therefore, often expressed as mental or 
educational ages. Though it may be useful to know that, say, E 
arithmetic of a poorly educated adult is only equivalent to the leve 
achieved by average ten-year-old children, age units are scarcely 
applicable to superior or even to average adults. Abilities increase 
or decline after fourteen years in so varied a manner (cf. Chapter 
XI) that no fixed standards can be determined. Intelligence 
quotients or I.Q.s are sometimes used with adult intelligence tests, 
for example, the Wechsler-Bellevue scale, but are disliked by 
vocational psychologists not merely because their derivation differs 
from that among children*, but also because comparable units {ш 
‚ For example, it might be more on 
mechanical comprehension or mors 
hanical quotients and morse quotients» 


Е : а 5 
convenient group, Thus, in the Navy different percentile norm! 
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on most of their tests into selection grades or groups (S.G.s).* 
Group A or S.G.I. represents the top 10 per cent. of scores, i.e. 
the outstanding. x í 
Group В or II represents the next 20 per cent. of scores, i.e. 
above average. 
Group C or III- and Ш — represents the next 40 per cent. of 
Scores, i.e., average. f 
Group D or IV represents the next 20 per cent. of scores, i.e. 
below average. е 
Group E or V represents the bottom 10 per cent. of scores, i.e. 
very low. 4 
If the same grouping was applied to the Terman-Merrill 
revision of the Stanford-Binet test the I.Q.s falling into these 
Various grades would be approximately: S.G.I 121 and ир 
S.G.II 109-120, III 4- 100-108, III— 92-99, IV 80-91, V 79 bal 
below. Such a system is easily got across to the layman, and has _ 
great advantage that the S.G.s or the percentiles for all tests 
Standardised on the same population are comparable. A 70th e 
centile score, for example, represents the same relative degree о; 
Superiority on Matrices or mechanical comprehension or morse 
aptitude. › 
Yet another form of calibration sometimes used in vocational 
Work consists of minimum or critical test scores needed for 
entrance to certain jobs. This is an undesirable system i male 
reasons. First, it neglects variations in supply and deman à 
Obviously the critical score can be raised when there are plenty o 
Candidates for a few vacancies, and must be lowered if the situation 


- 15 reversed. Secondly, such rigidity is essentially unpsychological. 


€ have shown that test scores constituted only one of the many 
factors influencing employment recommendations. xn be 
encouraged to select men with lowish scores who iae ое 
Outstanding qualifications, and to raise their standards if y d ү 
Dot. Thirdly, there are numerous technical difficulties : xing 
satisfactory pass marks (cf. McClelland, 1942), particularly w! 
Several tests need to be taken into account. 
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* Other types of standard scores are often used in ence bir appear 

offer no particular advantage. Stanines group scores into ИШ е А КӨШ: 

Our five SGs, А 7:24:38:24:7 per cent. split is often preferred to Pu. 
710, since it accords more closely with the normal frequency 


but it is less readily grasped by the non-psychologist. 
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Interpretation of Test Results 

Although we have rejected the notion of critical scores, we can- 
not shirk the problem of providing some method of indicating the 
relevance of test performances to jobs. Not only the P.S.O. but 
also the interested layman wishes to know how accurately or 
inaccurately a battery of tests will select, and at what level of Scores 
candidates are likely to succeed or fail. The only method which 
satisfies the statistician is that of correlation and regression соећ- 
cients, but, unfortunately, it is hardly intelligible to persons with- 
out statistical training, and is often scarcely practicable. Most of 
the proposed alternatives such as “wastage coefficients,” “efficiency 
indices," etc., are very cumbersome and tend to be misleading. For 
example, a common procedure is to compare the distributions of 
test scores of men who turn out to be good and bad at the job, and 
to choose a pass mark or critical score which admits as many of the 
former and rejects as many of the latter as possible. This is repre- 
hensible since such distributions (unless obtained from thousands 
of cases) always show irregularities, and one is tempted to choose 
à mark which, by taking advantage of irregularities, gives the most 
favourable differentiation, In fact, the smaller the groups com- 
pared, the more optimistic is the forecast of the test's validity likely 
to be. Moreover, the numbers or proportions thus found to be 
Correctly selected or rejected depend to a large extent not merely 
on the goodness of the test, but also on the proportions of goods 
and bads, and on the Proportion of candidates which P.S.O.s can 
afford to exclude. As pointed out in Chapter VII, the higher the 
selection ratio, the better will the selection test appear to be— 


percentages, or to draw graphs or histograms, showing the pro- 


and all jobs, but it does Not readily indicate how low the P.S.O. Сай 
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go in selecting men unlikely to fail the course. Nor, when several 
tests are predictive of success, does it indicate how their findings 
are to be combined, or how much attention should be paid to each. 

In the British Army, S.G.s were suggested for each main job on 
the tests believed, as a result of job analysis, to be most relevant, 
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these being modified appropriately in the light of distributions of 
Scores among men actually engaged on the work, and of later 
Validation data. Thus, a minimum of S.G.3—might be set on the 
Clerical test, but no minimum on the mechanical comprehension 
test, for clerks, Though this system is generally applicable and 
adequately flexible, it was unsatisfactory for several reasons. + 

(1) No information was provided as to the likelihood of success 
or failure above or below the minima. 

(2) The imposition of different minima on different tests is not, 
in fact, equivalent to weighting the tests in accordance with 
the obtained regression equation. 

(3) Some P.S.O.s found it too complicated and preferred to rely 
on their own hunches, but others followed it too literally, 
with disconcerting results. For example, if no standard was 
Set on one test, say arithmetic, most of the weak arith- 

_Meticians would be herded into this job. 

It is the writer’s conviction that any such mechanical system of 
test interpretation would soon be liable to develop similar flaws. A 
Scheme which was developed in the Navy gives the distributions 
among passes and among failures (or men regarded by instructors 
aS Very poor), on two or three of the most relevant tests only. 
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Usually one test is the measure of all-round ability, T2 and the 
others are supplementary ones to which special attention should be 
paid*. As an example, Table XII lists the information for coders. 


TABLE XII.—SELECTION Test DATA FOR CODERS 
Period: July-Sept. 1943, N = 528 


T2 -  Testl Test 70 
S.G. Abstraction Dictation | 
Pass Fail Pass Fail Pass Fail 
= ae e p dum о, | =» тт 
AREE еу + 29} i 214 1 
B 41 24 49 3 401 3 
[e 25 7 19 7 37 8 
D 2h 2 21 2 1 i 
E E i = = E 5 
ст ЛЕА ИЕ АНГИ А [run ВА A «эй 
90 percentile — .. 125 16 35 
Median ,, 7 100 13 33 
10 percentile э 72 10 30 
| Se PERS 
Correlation with 
Practical training "543 "584 370 


The figures could equally well be presented in the form of a histo- 
gram. The pass columns give the percentage S.G. distributions 
(to the nearest half) among trainees found satisfactory. The fail 
columns give the proportional numbers. Thus, the actual failure 
rate was 11-5 per cent., corresponding to 123 fails for every 100 
passes. Percentile levels among passes are also shown. Thus, the 
P.S.O. can teadily see that As and Bs on the three tests have 


adapted to civilian educational and vocational classification Pe 

cedures, For example, similar tables might show the adequacy 4l 
EXE HRS 

a child's scores on general intelligence, educational and mechanica 


multiple Tegression equation could be obtai by Т2 plus 
ined by T? alone, or by 
one or two Supplementary tests equally Weighted, y $ 


CHAPTER XI 
EFFECTS OF PRACTICE, AGE AND OTHER FACTORS 
ON TEST SCORES 


Abstract. Investigations of civilians (adults and children) and of 
recruits show an average rise, on taking a psychological test a 
ene time, equivalent to about 5 I.Q. points, apparently regard- 
ess of the proximity of the re-testing. Further re-tests produce 
Progressively smaller increases, hence the provision of practice 
sheets or of one or two preliminary tests, helps to even up testees 
Who possess different degrees of previous experience. The effect 
15 smaller in straightforward tests with ample time limits, than in 
choice-response tests or tests with complicated instructions, with 
ушер Most testees in this country are unfamiliar. Though the 
бсле effect of one test on another is about twice as great when 
jid are identical as when they are only partly similar in form or 
ent, yet any kind of schooling and the taking of examinations 
may produce slight improvements among adults. 
Е did not constitute sufficiently representative samples 
i € general population to provide definitive results on changes 
т test scores with age. However, the more rapid decline in abstract 
enes, spatial and physical abilities than in educational 
бе m was confirmed, and considerable evidence was obtained 
fGen declines among adults of initially low ability. Not only 
those ah but also intelligence appears to be better retained ру 
abilit; who use their brains" more. Again, mechanical-spatia 
110—5 show increases from 14 to 18 years; general intelligence 
Bi НУ rises only among those receiving further education, and 
Ucational abilities decline among those not receiving schooling. 


Small but consistent geographical differences in intelligence 
d areas which contain 


м between the east or south an: 1 t 
t Iderable proportions of Welsh and Irish recruits. Intakes into 
d three men's Services varied at different times, and average 
епова between Army, Navy and R.A.F. ground personnel 
pr e small. However, the Army certainly received the biggest 
Portions of low grade, and R.A.F. air crew large proportions of 
181 
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high grade, recruits. Members of pre-Service organisations (A.T.C., 
Sea Cadets, Army Cadets, etc.), also Scouts, showed superior 
qualities in the Services. But this was attributable more to the 
intellectual, educational and other traits of youths who join these 
organisations than to the training as such, which was shown to 
produce rather limited effects. Physical training courses were 
found to improve the scores of Army recruits with poor physique 
on certain physical and mental tests, but menstruation among 
A.T.S. recruits had no consistent influence on test performance. 
= —.. 

А уегу large proportion of recruits in the Forces and of adults 
in industry have not met an intelligence, or other psychological, 
test before, hence their scores are likely to improve considerably 
as they gain familiarity with such tests. Tt is important to determine 
the extent of such improvement, both because different recruits no 
doubt possess different degrees of previous experience, and 
because re-testing is frequently necessary when original scores, 
cannot be traced. We will first outline some of the published 
evidence on practice effects. 

Dearborn and Rothney (1941) summarise most of the American 
work and show, from their Harvard Growth Study, that practice 
effects are generally not very large, but that they do occur with 
Some—not all—group tests. They do not seem to be confined to 
any particular type of test material. With repeated testing the 
effects tend to diminish, i.e, the greatest increase is from the first 
to the second test. These authors also claim that practice on one 


test affects that test only and does not extend to other slightly 
different tests, ? 
Some, but not all of these results are confirmed by British investi- 
gations, and we would Suggest that the difference lies in the greater 
degree of “test-sophistication” among American children and 


| spreading to occur from any one test to other dissimilar ones. For 
example, Vernon (1938b) investigated test-sophistication among 
university-trained adults and showed that the study of tests 1n 
general and their make-up, and practice in taking various types of 
test, raised scores both on verbal and non-verbal intelligence tests 
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which had siot been taken before by as much as 8-4 per cent.* An 
American study by Bowers and Woods (1941) points in the same 
direction. They compared the intelligence test scores of over 1,700 
students who had taken none, one, two and three or more intel- 
ligence tests during their school careers. If the scores of the no- 
experience group are reduced to 100, the scores of the other three 
groups average 106, 108 and 110 respectively. Evidence is then 
Biven to show that the differences even up in the course of a year's 
Work at college, and that they are far greater among students from 
rural areas and small towns than among students from large towns. 
À plausible explanation for this is that the big high schools in large 
towns would make more use of new-type achievement tests, and 
that their ex-pupils would be more test-sophisticated. 3 
_ А. С. Rodger (1936) applied six parallel tests at fortnightly 
intervals to ninety-five British 11-12 year old pupils and claims 
that the average increase was about 1 per cent., or 1 І.О. point, 
Per test. His figures suggest, however, a rise of 3.8 per cent. 
between the first and third tests and thereafter no further change. 

he Moray House tests which he used have a fore-exercise Or 
Practice sheet, which probably minimises the susceptibility of 
the tests themselves to practice. Rodger also states that the 
tise was greatest (14 points) in brighter children of Т.О. about 120, 
and lower (3 point) in dull children of I.Q. about 80. Dearborn 
and Rothney make a similar claim. It is a nice theory that the most 
intelligent, because of their intelligence, improve most. But all our 
4с, cited below, shows the greatest rises among those scoring 
east. Dearborn’s and Rodger’s conclusion is, in fact, based on 
changes in test norms, not on rises among actual pupils. 
cRae (1942) likewise gave sets of six tests to small group 
Weekly intervals and confirms the diminishing effects of repeate 
Practice. He concludes that when testees vary initially in their 
Previous familiarity with tests, a single test will act as 2 sufficient 

Shock-absorber” to bring them all on a par. He also noted, when 
8iving parallel versions of a test, that the effects on Form B of 
actual coaching on Form A were no greater than the effects of 
Merely doing Form A in the ordinary way. 

& àve (1938) found that some types of test are much more 
atected by practice than others, non-verbal and spatial items 


* All sc ; : b converted into 8 
eu ore changes quoted in this Chapter have Pern cOn iteration 
Е non Scaie, roughly parallel to intelligence quonen = овет Ъу 15. 


сеп divided by the Standard Deviation of score 


s at 
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apparently being more susceptible than most verbal items. Finally, 
recent researches by Heim, Wallace and Carpenter showed that 
continued practice on a single test may improve scores almost 
indefinitely *. When nine W.E.A. students took the A.H.4 test 
eight times in successive weeks, there was some tendency to 
flattening out after about the fifth occasion, but some subjects con- 
tinued to rise until they attained nearly perfect scores. Nine seamen 
tested nine times on almost consecutive days gave similar results. 
These show the dangers of leakages of test material and of 
unauthorised coaching. It was noteworthy, however, that the 
correlations between “unpractised” and “practised” test scores 
remained extremely high. Thus a test may still be a valid test of 
intelligence after Practice, provided that all testees have had the 
same amount of practice. " 

Turning to work in the Forces, the Matrices test was re-taken by 
537 seamen in an entry establishment one to six months after 1t 
had been done at recruiting centres. The average rise was 4-7 points 
ог 8:6 per cent. But as the reliability of this test is rather low, the 
correlation between the two sets of scores being only -79, the 
alterations were irregular. Some men actually declined on the 
second occasion, and the total range of changes was from 25 points 
increase to 13 points decrease. A natural consequence of what 18 
called the regression effect is that very high scorers showed least 
improvement, very low scorers most. 


An experiment in the Army where the same test was re-taken by 
277 men after only one day yielded almost the same rise, and other 
later work suggested that Practice effects are much the same after 
several months as they are immediately. But a possible alternative 
explanation is that military or naval training, including the taking 
of proficiency examinations, also helps performance at tests, hence 
a rise after six months may be partly due to this and only partly 0 
recollections of the previous testing. Sometimes, of course, the 
training received is directly relevant to the abilities tested, €-£ 
mathematical or mechanical. This would account for the different 


reliabilities and re-test rises, 
‘The Army results listed in Table XIII were based on re-testing 


* From unpublished Work communicated by courtesy of the M.R.C.'s Unit 


p een in Applied Psychology, at the Cambridge Psychological Labora- 
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TABLE XIII.—PzncENTAGE INCREASES ON RE-TESTING AMONG ARMY AND NAVAL 
RECRUITS 


Percentage Rises 


Test 
Navy 


l Abstraction 


3:3 
2 Bennett Mechanical `. m 104 
Arithmetic Mathematics 21 TO 

or Squares va 8:0 
25 Verbal 33 = 
12 Clerical 66 = 
в Assembly + 60 = 
orse Apti . == 
16 Agilitys ^ c 72 E 


500 men, representative of the total intake, after eight weeks of 
Primary training (which involved little or no bookwork). In the 
Navy, however, 500 air mechanics were re-tested after six to eight 
months of mechanical training. In spite of the longer interval their 
increases on Tests 2 and 3 are far larger. Squares (Test 4) is 
Slightly larger, but Abstraction (Test 1, not taken in the Army) 
Was probably uninfluenced by their training, hence it shows only 
a small rise, 
The usual regression effect was observed on most tests. The top 
Per cent. of scorers achieved only a slight improvement or even 
declined on the second test, and lower groups showed successively 
greater increases, But in three tests— Verbal, Clerical and Squares 
(the latter in both Services)—men scoring in the 90th to 50th 
Percentiles at the first test showed as great improvement as those 
in the bottom half, Probably this is due to the positively skewed or 
almost rectangular distributions of scores on these tests. 
he total increase on 'Т? in the Naval group was larger than 
st 9n any component test, namely, 12-4 per cent. This is yey 
considerable, being equivalent to an average rise of nearly 1 S. 1 
‚уеп if half of it is attributable specifically to the mechanica 
training, it means that T2 is very sensitive to the practice effects 
of previous testing and of more generalised schooling. - 
, Some evidence indicates that the effects of taking a different set 
of tests before are smaller than the effects of doing the same test. 
ne average increase, shown in Table XIII on eight Army tests, 


t i 2 ^ ique 
am The high figure for Agility might be ascribed to improv ement of Bye 
ex; ng recruits in their first eight weeks of Army life. However, RUE 

Periment, an almost immediate re-test yielded a still greater impr 


nearly 12 per cent. 
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also on Matrices and Abstraction, due to taking the same test 
before (along with others) amounts to 5-3 per cent. But in an 
experiment described below (p. 197) the Naval T2 tests and the 
R.A.F.'s G.V.K. tests were taken by the same recruits a few days 
apart, and the practice effect on whichever battery was taken second 
was estimated as 3-6 per cent. This is similar to the 2i рер cene 
(2 to 3 points of Г.О.) admitted by Terman and Merrill when 
Form M of the revised Stanford-Binet test is given shortly after 
Form L, or vice versa. , 
Further consideration of Table XIII suggests that the biggest 
effects occur in tests which are most novel, or which have most 
complicated instructions. Squares and Clerical are particularly 
difficult to “get across.” The Arithmetic test, which is entirely 
straightforward and familiar; has the smallest increase of 2:1 per 
cent., and the Naval Abstraction test, whose instructions are 50 
simple that it is self-administering, is also very low. This is borne 
out by the following investigations. ‘ А 
Several versions of a mechanical comprehension and information 
test, and of a mathematics test, were devised for use at r ecruiting 
centres, and were simplified in order to make them self-administer- 
ing. These were tried out on 1,400 recruits in H.M.S. Roya 
Arthur of whom 190 had taken a version of these tests а few 
months earlier at recruiting centres, while the remainder had 77 
Matrices. The former group showed no superiority on the gae 
matics section, and were only 1-5 per cent. superior ОП 9 
mechanical section as a result of their previous experience. z 
In an Army experiment, Matrices was given twice on em 
secutive days with a forty-minute time limit to 270 recruits m 
an alteration of under 6 per cent. was found, contrasting with A 
figure of over 8 per cent. obtained with a twenty-minute ШШ 
The longer limit presumably allowed the testees to get more из 
to the unfamiliar task*. Other studies were made of dictation an< 
spelling tests. Five passages were tried out on 647 seamen in an en 
establishment, each class getting a different set of two passages p 
different order, The average scores on the second passage Ve 
only 12 Per cent. superior, and this is perhaps attributable = 
familiarisation with the dictator's pronunciation. In anot 
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research, four groups of fifty recruits took five E uae 
Succession. One was the A. T.S. Spelling, two involv ed dictati А 
Sentences іп which опе difficult word had to be written down, s à 
two were of a novel type. Each item consisted of bu Е: 
Synonyms, опе of which was incorrectly spelled. НЕ оз 
identify and re-write the wrong words. For example: 
STREET RODE be f the tests 
No practice effect whatever could be discovered in any В ws 
except this last, unfamiliar, one where it amounted to а 
cent, : n 
nhe following appear to be шеп conclusions arising out of 
the investigations we have described: БТР). | 
(1) Test. scores are seriously disturbed if different еа 
had different amounts of practice on the mum к ad 
(2) The effects of taking other similar tests are smalle / 
Still very considerable. m 
` (3) Such Eu tends to show diminishing ae y AEE 
(4) The effects of practice are probably lasting, Sree point 
Breat after some months as after a few days. 
requires much fuller investigation. it is 
(5) Am from training in the subject matter of i dang of 
Possible that any kind of schooling SES i: appreciable 
examinations (especially new-type ones) have ap 
effects. east 
(6) The effects tend to be greatest among Wa m 
accustomed to tests, and possibly to bookwor A үш, 
\ (7) The effects are much smaller in ыан ed e 
response tests with simple instructions an E 3f 
allowance than in unfamiliar tests with е Vere and 
Structions, also in verbal than in most non 
Spatial tests. ü Ver. 
(8) They can be reduced by adequate fore-exero ses. er 
in view of No. 3, testées with different en one or two 
can be brought to a more even level by taking 
reliminary tests. ; lesirable, 
(9) ij euet with the same, or parallel, tests is unden pe 
but when unavoidable distinct norms or stan "m the same 
Provided. So far as is known re-test scores Пао Uie uy 
Validity as original scores, but this, too, req 
Study. 
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Age and Other Differences i 
Many results of genéral interest might have been anticipated 
from the large-scale testing of recruits during the war, for example, 
comparisons between the three Services, between men from 
different parts of the country, or from different occupations, and 
so forth. Such enquiries are, however, much more complicated 
than they appear at first sight since age, occupation, recruitment 
policy and other factors that influence test scores are so interwoven. 
Eighteen-year-old recruits were probably fairly representative of 
the population as a whole, at least in 1942-3, though even at this 
age a considerable number had their call-up postponed, since they 
Were in reserved occupatións; and these were mostly in skilled 
trades or professions and, therefore, of superior intelligence. A 
very small proportion of men of low medical category (mostly 
below average in intelligence), of psychiatrically unfit and of mental 
defectives, were rejected. The composition of older batches of 
recruits was extremely variable owing to the Ministry of Labour's 
reservation policy. In one month, for example, large numbers of 
20-25 year old policemen were de-reserved and intakes into all the 
Services were of outstandingly high quality. In another month 
there was a big proportion of 30—40 year builders, and the quality 
of intakes was lowered partly because the average for building 
tradesmen and labourers is a little below that of the general 
population, partly because test scores tend to decrease with аре. 
Again when an occupational group was partially de-reserved, 
employers would naturally tend to hang on to their best and 
presumably most intelligent men and to release the duller ones for 
the Forces, Thus observed age differences were distorted by differ- 
ences in the occupational make-up of groups of recruits of different 
ages, and occupational differences were distorted by the age 
groups that happened to be called up during a certain period. 
Psychologists were, of course, unable to control these factors, OY 
to hold them constant, as they would do in a laboratory expeti- 
ment, hence it was almost impossible to disentangle their effects. 
Pre-war research, though often on а small scale, had established 
the following facts by taking special precautions to secure repre- 
Sentative cross-sections of the population: 
(1) General intelligence increases on the average at a steady rate 
up to about 12 years. The rate of increase then declines unt! 
à maximum is reached by about 15 years, though on some 
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tests there is little if any increase after 13 and on others rises 
have been reported even beyond 18-20*. 

(2) Between the twenties and sixties there is a progressive 
decline on tests of g involving abstract reasoning and speed 
of mental manipulation, though on other tests of what has 
been called “crystallised” intelligence, such as vocabulary 
and information, the level is better maintained (cf. Cattell, 
1943; Brody, 1944). 

(3) No one individual necessarily conforms closely to these 
average trends. “Longitudinal” studies of particular chil- 
dren often show great irregularities of mental growth, with 
spurts and plateaux attributable partly to emotional adjust- 
ment or maladjustment, partly to the stimulating or inhibit- 
ing effects of the child’s home and school environment (cf. 
Fleming, 1948). The same may well be true of adults, but 
longitudinal studies are much more difficult since an adult 
cannot be re-tested many times without his scores being 
affected by “sophistication,” and by his attitudes towards 
the investigation, e.g. growing hostility. 

(4) Educational attainments tend to be forgotten rapidly after 
leaving school, except in so far as they are used in daily life. 
Thus, Norris (1940) finds that scores on linguistic tests may 
rise till about 40 years, but arithmetic achievement declines 
In most persons other than clerks who practise arithmetic in 
their job. The performance of adults on intelligence tests, 
according to Lorge (1945), is also affected by education 
beyond 14 years, though, as Garrett (1946) points out, 
results on a verbal test do not necessarily mean that intelli- 
gence itself alters. Adults with a university education, tested 
at 34 years, were about two years superior in mental age to 
others of the same intelligence level when aged 14 years who 
had received no secondary or university schooling. An 
investigation by Miles (1932) also appeared to show an 

| earlier decline in intelligence among adults who had only 

had elementary schooling, and Cattell (1943) claims that 
Superior occupational groups retain their intelligence better 
than lower ones. 

Several studies in the Forces helped to supplement these 


* 
15, Dearborn and Rothney (1941) claim increases up to much later ages than 
| €y are assuming an indefinite continuation of schooling. 
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conclusions. The abstract reasoning ability which is раш 
affected by age is typified in the matrices test. Т hus, it v ї widely 
what unfortunate that this test should be the one eRe. pu 
applied in the Royal Navy, Army and A.T.S. When i nif aged 
introduced the Navy happened to be recruiting men b. 
30 and over, and the norms which were established then, an note 
could not readily be altered, were extremely faulty ш ы 
to later intakes consisting mostly of 18-year-olds. The S. á 1022 
chosen to divide the older population into 10, 20, 40, adus ЫШ! 
cent. groups, but with more representative recruits t ILIN ' 
12 to 14 per cent. of S.G.1s and only 1 to 5 per cent. А 
cf. Table XIX)*. ined 
| A typical set Я correlations for nine tests with age de. E 
in 1942 in a group of 578 Army recruits well spaced outi 3 Е, 
40 years; these are listed in Table XIV. All of them, it wi 


омо ДЕМИ 
TABLE XIV.— ConnstATIONS BETWEEN Test SCORES AND AGE AMON 


Rzcnurrs 


Test Correlation 
ёл ? iub 
Progressive Matrices E e e Lo oe 


10 Morse Aptitude 
16 Agility s 
Squares $m 54 
2 Bennett Mechanical | | 
2 Clerical Bc 
3 Arithmetic 
8 Assembly rh 
17 Messages (Verbal) 


ich 
i : E F es whi 
are negative, even with tests such as arithmetic and messag 


slowly, so confi 
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tion for all tests was —-137. When the 30s and over were con- 
trasted with 295 and under it was —:149, but when 35s and over 
were contrasted with 34s and under the average coefficient rose 
to —-288, 

The A.T.S. differed from the Army in that recruits of, say, 
25 and over were often of better quality, drawn from higher-grade 
Occupations, than the younger ones, since more of them were 
volunteers, All their correlations with age, therefore, tended to be 
More highly positive or less highly negative than those of men. In 
Опе representative sample of 200, they ranged from —:100 for 
Matrices and —-070 for Squares to 4-:147 for Verbal (Test 25), and 
+180 for Spelling. This —-100 for Matrices is almost as cer- 
tainly too small, as the —-340 (in Table XIV) is too high. Hence 
the correlation of —-238 obtained among 90,000 naval recruits, 
Whose occupational make-up was held constant, may well be about 
right (cf, Table XV). 


Differential Decline with Age ; 

Since age changes differ on different tests it should be pointed 
Out that any composite intelligence score, based on several tests of 
varied types (such as T2 or Summed S.G.) will have a somewhat 
different make-up at different ages. A man of 20 and another of 35 
May obtain the same Summed S.G., but the former is likely to 
Score better on Matrices and Bennett, the latter on Verbal and 
Arithmetic, 

That the rate of decline in g (as measured by Matrices, not by 
petal intelligence tests) is greater at lower intelligence levels was 
dicated bya survey of 90,000 naval recruits in 1942 (cf. Vernon, 
1947c), They were classified under twelve broad occupational 


_Aeadings, so that it was possible to maintain the same occupational 


Stribution at all ages. Table XV shows the names of the groups 
and total numbers, together with the mean scores of 16- to 19-year- 
olds (average about 18:0) and the mean scores of 20- to 40-year-olds 

Average about 30-0). The last two columns show the decreases 
‘tween the 16-17 and the 18-19 year groups, also between the 
~19 year groups and those aged 30 and over. In the first of the 

1 V columns the decline in the four most intelligent occupations 
5 tess than a quarter the decline in the three dullest occupations. 
Е. the last column the results are more irregular, possibly because 
the small numbers available at later ages, nevertheless, the 


Sy 
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TABLE XV.—Averace MATRICES SCORES AND DECLINES WITH AGE IN TWELVE 
E OCCUPATIONAL Groups 


o р Mean Scores at Decreases from 
lecupation No. 
16+ to | 16+ to 
180 | soo |f [Spy 
Cleks ..  ..  ,. | 11,608 434 | 391 | 039 | 507 
sel imis Workers zs 4,602 403 37:3 0°16 528 
oodworkers .. ра 6,705 39:0 6:2 0:25 3 
Precision Workers x 11,007 38:9 355 0:33 5:43 
Sheet Metal Workers |. 6401 | 374 | 351 | 049 2 
etail Tradesmen s 9,373 37° 34 " 3 
Machine Operators ss 5/936 He a 042 507 
Builders .. "07 П 9. 77 443 
HU s cu 1014 | 364 | 329 | 0-77 
T о у. 5,764 | 346 | зів | 051 | 466 
б PUN 5,367 | 306 | 306 | oas | 840 
Farm Workers |; 1! 2,406 | 334 | 305 | 1:39 | 51 
abourers + e | 13466 | 336 | 286 | 157 | 699 
E 
SM Бата | ara | gap | ous | 2888 


decline is more than twice as great among labourers and mates 2$ 
among woodworkers and sheet metal workers. The interpretation 
of these figures is dubious both because the 16-17-year-olds consist 
of volunteers for the Navy, whereas the 18-year and later groups 
consist mostly of less intelligent conscripts, also because 0 
unknown effects of de-reservation policy. It might be rash to con- 
clude that the general decline in g starts as early as 17. There 19 


however, no doubt of the statistical significance of differential 


rates of decline in different Occupations, and they appear to fit the 


hypothesis of more rapid decline among men who make least “use 
of their brains.” 


The same point is brought out in another way in Table хүр 


s 
Which shows the Percentages of successive age groups obtaining 


Very high Matrices Scores—50-54 and 55-60, together with the 


т. = F 
ABLE XVI. PERCENTAGE Matrices Disrriputions Амр MAN SCORES s 
DIFFERENT AGE GROUPS . 


Age (years) 


25-29 | 30-34 | 35-39 | 40+ 


035 | 015 | 0:04 | 069 
255 | 169 | 129 | 43.13 
52:99 | 4850 | 42°80 | 56-18 
4411 | 49:60 | 55:87 
8—1 5 748 


4060 | 5,351 | 7,577 | 550 


EA 
А ^ 3466 | 3374 | 32°40 . 
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numbers and mean scores for all occupations combined. In spite 
of the steady decrease in the mean with age and in the proportion 
of above average scorers (from 70 per cent. at 16-17 to 34 per cent. 
at 40 +), the proportion scoring 55 and over remains steady (or 
€ven rises) till 29 years, and only then starts to drop rapidly. Again 
the proportion scoring 50-54 remains steady (or rises) to 24 years, 
and then drops. While there may be some alternative explanation, 
ү certainly looks as if the most intelligent retain their intelligence 
ongest, 


Changes with Age from 14 to 18+ 

In an investigation of naval artificer training establishments, 
Where boys enter at 14 and are trained as skilled tradesmen till 18, 
S.P. Tests 1—4 and several supplementary tests were given to all 
Oys, including 300 of average age 154 and 250 of average age 18}. 
Although this population is far from representative of the general 
Average, being highly selected educationally, it has the advantage 
2 remaining stable in composition throughout the four years since 
(unlike the ordinary secondary school) scarcely any boys are 
eliminated once they have started the course. The gains in scores 
from 15 to 18 listed in Table XVII have been conyerted into a per- 
centage scale to make them comparable for all tests. There is a con- 


Т, Е 
у XVII.—Percentace INCREASES OR DECREASES IN TEST SCORES Der 
AND 18-- AMONG ARTIFICER APPRENTICES AND AMONG Bovs LEAVIN 
ScHooL AT 14 


Gains or Losses Among 


Test 
Artificers l4-yr. Leavers 
a 
Д Abstraction d T x +48 +10 
ennett M. i 10:0 5 
Se Arthmede es ушыр; 1з 
athematics 23 ES R +65 d zm 
qu e -L9:9 . zu 
от 12 (eia in A sie +19 
117 Memory forDesigns .. "m +55 E 
E echanical Information e +142 ER 
110 lectrical Information .. y T216 m 
Mechanical Models — .. = +40 = 


Siderable improvement throughout; and it is almost as great in the 
Ds Test g test (Test 1) as in mathematics—that is а school subject 
S Which boys receive a fair amount of instruction. As one would 
Xpect it is greatest in the information tests (117E, M, and 2) on 
P.s.—7 


\ 
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- account of the training, but it is very large also in Test 4, that is in 

spatial ability which is not directly trained at all. Бу one МТС 
During 1946—7 several S.P. tests were applied by o toda 

writers and his students at the University of Glasgow table 
groups of school children. One sample* included all i Е pi. 
14-year-old boys leaving school in January, 1947, keen E. 
1,200. Naturally this was a rather inferior sample, but t for neti 
on the T2 tests were compared, not with the Wer m 
recruits in general, but with the scores ofa group ies ME. 
rectuits who left Glasgow schools at the age of 14 in 193 decrease 
entered the Navy in 1942. The percentage increases rig Tana 
in this group over the 14-year-boys also appear in Table оса 
are very different from those found among boys of үш ligible 
ligence whose education was continued. There is only a neg! the £ 
increase in the all-round ability measured by T2 or E Too 
measured by Test 1. On educational tests there is a serio WP. 
On the other hand there is a considerable rise in wy s 
spatial ability, although it is unlikely that many of the id 
recruits had received any Systematic technical or trade t ithout 
It would appear then that k : m abilities rise with or Me s 
schooling from 14 to 18, that v : ed abilities drop and g оош 
fairly constant, unless they are stimulated by secondary sc e 
But the constancy in all-round ability measured by a Бае the 
as T2 is specious, and occurs merely because the d side’ 
Practical side roughly balances the loss on the educationa 


to 
z 5 found 
average adult standards, as obtained in the Forces, were 


tively- 
be reached by children of 13, 123, 13 and 154 years respec 
That is to say, 


the first three, 
of 14, but—jus 


er 
hae ‚ Furt 
14-year-old. on a performance test which involves mer showe 
analysis of the Abstraction, Spelling and Arithmetic resu t the top 
that the standards at 14 and 18 are just about the same a ts 
Я Есопот 
* This testing was part of a Survey undertaken by the Social an ered oe 
Research Department of Glasgow University. The boys were s ft 
60 schools, and iter i i 
Glasgow Corporation Child 
the testing and Scoring. 
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end of the scale. The best 10 per cent. of adult recruits do as well 
as the best 10 per cent. of normal 14-year-olds. But there is a pro- 
gressively greater decline at the lower end. The bottom 10 per 
cent. appear to lose 1} years on Abstraction, two years on Arithmetic 
and three years on Spelling. At 14 years the 10 per cent. poorest in 
spelling reached a maximum level equivalent to the average 11-year- 
old; whereas among recruits the corresponding level is only 8 years. 
It is little wonder that there were so many complaints of semi- 
illiteracy and appalling spelling among recruits when dull adoles- 
cents lose so rapidly the skills in which they were drilled till 14. 
The better retention among brighter men, who receive further 
schooling or who use their skills in their jobs, confirms our con- 
clusion as to the différential decline in abilities. 

We see then that the average performance of a group of adoles- 
cents or adults on psychological tests varies greatly with the type 
of ability tested, with age, and with any schooling or other training 
the group has received, or forgotten. This complicates tremend- 
ously the establishment of satisfactory test norms. A further 
corollary, since tests of g appear to be affected in much the same 
Way as educational knowledge by use or disuse, is that psycho- 
logists and educationists should investigate just what types of 
adolescent and adult education, and of occupational and avoca- 
tional experience, most effectively stimulate intelligence. Not only 
can it be raised during the ’teens by schooling, but also the inevit- 
able decline in adulthood can probably be retarded. 


` Occupational and Geographical Differences 


The striking differences on T2 and other tests between men in 
various Service employments are described in the next Chapter. 
Few large-scale studies of civilian occupational differences were 
Carried out, apart from the one whose results are listed in Table 
XV. This was the only one, too, where the age distribution was 
held constant in all groups. It may be observed that the range of 
Scores from clerks at the top to labourers at the bottom is rather 
Small, corresponding in terms of I.Q. to a range of about 110 to 
90*. Verbal tests such as Army Alpha and Cattell’s Scale III give 
4 much wider range than does Matrices. This bears out our 


* This assumes a Standard Deviation of 16. Cattell’s Scale III has a S.D. of 
С ad we assumed this figure our range would be perhaps 116 to 84. But 

attell’s range is still much wider, running from clerks with I.Q. 127 to 
Packers and sorters І.О. 78. " 
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contention (in the next Chapter) that occupational suitability, and, 
therefore, occupational level, depends on v : ed factor as well as 
on g. . 

The only regions of Britain studied were the nine very herera 
geneous areas into which naval recruiting centres are grouped. 
Small yet significant differences are indicated by the mean scores 
listed in Table XVIII. The first column of means gives the 


FERENT. 
Taste XVIII.—MzAN Matrices Scores or NAVAL Recruits From DIFFEREN! 


AREAS 
dean | Scores 
Area : No. Ма (ii) 
London, including Kent E. Anglia and Northamp- E i 
ton Ba z A Ade É d. \.. a 22,376 Зур 877 
Manchester, including Yorkshire Ж x [19:026 372 30:5 
Derby, including Lincoln and Nottingham m 5,496 36°5 
Southampton, including Sussex, Oxford, and 36°8 
Dorset .. nA Ha S * +» |* 7,108 36°3 
Newcastle, including Cumberland, Westmorland 30:2 
and Durham — .. ba a vr e 5,264 36°0 36:0 
Liverpool, including W. Lancashire and N. Wales 7,690 358 
irmingham, including Gloucester, Hereford, S 35:5 
Worcester, Shropshire and Mid-Wales Б 9,397 355 
Bristol, including Wilts, Somerset, Devon, Corn- 352 
wall and S. Wales г, os E. h 7,988 35:0 35:3 
Glasgow, including the Whole of Scotland 11,223 | 34:8 ; 
Zur PETLAS | BESE 


obtained figures, corrected only for age differences. But occured 
and area are closely linked. Glasgow is low partly because it са 
tains the largest Proportion of labourers, Manchester high becau 


Naturally а bigger range of differences would be expected 
between more homogeneous areas. It is absurd to group, €-£ 


and Glasgow areas, and it is not 
bottom of the list. 


Another comparison was afforded by the survey of Glasgow 14- 


` 
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year school leavers, though these groups are not, of course, typical 
of the whole 14-year population. The mean T2 for 400 boys at 
Roman Catholic schools, i.e. mostly of Irish descent, was 55:0, and 
for 771 boys in other schools 64-3. This corresponds to a difference 
In average I.Q. of about 7 points. 


Service Differences 


E The belief was widely held that the R.A.F. got most of the 

cream” of recruits, that the Navy also had more than its share, 
While the Army had to put up chiefly with below-average men. 
This could not be directly proved or disproved since no one test 
Was taken by all three Services. However, a fairly trustworthy 
"bridge" was built up as follows. The G.V.K. tests were given to 
а group of 552 naval recruits, half before and half after taking the 
T2 tests. (The two batteries gave an inter-correlation of 792.) 
Allowance could thus be made for practice effects, and correspond- 
ing percentile levels on the batteries were found. Another conver- 
Sion table between Matrices and T2 was constructed (their inter- 


. Correlation, in an unselected group of naval candidates being 


estimated as :80). Now Matrices had been given to all Army and 
A.T.S. recruits in 1942 as well as in the Navy. Table XIX shows 
the percentage S.G. distribution on Matrices for some 100,000 
Army recruits entering during July-November, 1942, for an 
equally large group of accepted naval candidates during January- 
September, 1942, for 3,759 ordinary seamen within the same 
Period, and for A.T.S. intakes during the whole of the same 
Year*. It will be seen that the intakes are closely similar in numbers 


TABLE -XIX.—Mararces DISTRIBUTIONS DURING 1942 IN THE Army, Navy 


AND A.T.S. 

S.G. Score Arm: Naval Seamen A.T.S. 

y Acceptances са 
———| 

er cent. er cent. per cent. per cent. 

1 464 12:3 Pto 11:9 12 143 

2 404+ + 23:6 279 3076 27-6 

3 29+ 49:5 441 40:8 393 

4 20+ 16:2 123 140 17:5 

5 19— 54 12 27 r3 


і 


1 e "These are Army norms; unfortunately naval and A.T.S. norms differed by 
B 18 2 points. Since S.G.s alone were recorded at Recruiting Centres, not scores, 
58 У the Army and the О.5. figures are the percentages actually obseryed. Тһе 
* Sul and А.Т.5. figures have been converted to Army norms, and are unlikely 
О be inaccurate by more than 0.5 per cent. 
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f S.G.1s, but that it is true that the Army received a much ааа 
ү n of S.G.5s or very dull recruits. In terms of -— 
UA difference between Army and Navy € d at mos 
2 points on Matrices at this period, or 5 points on E slightly 

We can probably allow that the Army intakes pis nc 
inferior to the population as a whole, and that naval en nos 
ones were slightly superior. Naval norms were, meee add 
based on ordinary seamen, not on recruits in general. T ee nic 
that some of the brightest recruits, entered as radio ns с an 
telegraphists, etc., and some of the dullest entered as d hy 
cooks, were omitted. Table XIX shows that seamen were of s pee 
lower quality than naval acceptances, though better be -—— 
recruits, and it is a reasonable guess that, in average score а «hall 
they Were-very close to the norm for the population as din the 

Probably, however, they were more restricted in range tha wont 

general population, both because very low Matrices ae "ust 

rejected at recruiting centres, and because of the omissions J 

mentioned, of T2 

Table XX gives the best estimates that could be made derate- 
percentiles in R.A.F., Navy and Army. Being based on mo 


TABLE XX.—T2 Levers IN R.A.F., Navy AND ARMY = 
Percentiles 
Service Date NO | 
90th 

R.A.F. Air Crew . T 1942 1,141 141 
R.A.F. Ground Staff г 1942 5,000 116 
Navy O.S. ., 5 vi 1942 1,000 108 
Navy O.S. > 


1944 | 3/384 132 

Army recruits p = 1942 578 | 100 | 
y recruits ie Ў. 1945 1,000 112 
sized groups they are less г 
Previous Table, and the 90 
less reliable than the 50th. 
in 1942 Army recruits we 
and it may be seen that R. 
to seamen. By 1944-5, ho 
more candidates at rec 
only raise its standard 
fairly representative о 


, 


: he 
epresentative than the figures КАТ, 
th and 10th percentiles are n2 that 
The table bears out our сойо б еа 
re only slightly poorer than nava erior 
A.F. ground staff were but little weet any 
wever, the Navy was able to rejec a 
Tuiting centres, whereas the Army emain 
S to a small extent. These figures E i 
f peace-time intakes, R.A.F. air crew» 
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contrast with ground staffs, were clearly very superior, and pro- 
_ vided some substance for the Army’s complaint that the R.A.F. 
and Navy got the largest share of high-grade material — ' 


Effects of Pre-Service Training 

Great stress was laid by the Services on the value of pre-Service 
training in the Sea Cadet Corps, Junior or Senior Training Corps 
or Army Cadet Force and the Air Training Corps. Several 
follow-up studies in the Royal Navy showed that such recruits, 
together with members of Scouts and Boys’ Brigade—who con- 
stituted a quarter to a third of all intakes—were superior as seamen, 
in Coastal Forces, as radio and air mechanics and air fitters, and as - 
telegraphists; but it was doubtful how far this was due to their 
Possessing better g and education. Some 4,500 recruits in H.M.S. 
Royal Arthur were studied in 1944, including nearly a thousand 
Sea Cadets and a thousand A.T.C. members, and smaller groups 
from other organisations. The following rank order of average 
ability was obtained on T2 and on Test ЗЬ (Mathematics) and 
educational level: 

WEC апа SiG: 

Scouts and Sea Scouts. 

A.T.C. 

Members of no organisation. 

Sea Cadets. 

Boys’ Brigade and Church Lads’ Brigade. 
Army Cadets. 

It should be remembered that the best A.T.C. members were 
likely to join the R.A.F. and the best Army Cadets the Army, also 
that the general quality of entries was high at this time. In 1942-3 
Sea Cadets would certainly have been superior to average intakes. 
Tabulations were made of the number of recruits in each group 
recommended by P.S.O.s as suitable for the main categories— 
officer candidates, mechanics, writers and coders, communications, 
Seamen and stokers, cooks and stewards, etc. It was found that 
Sea Cadets provided rather more trainees for the most active 
branches—namely, officer candidates and seamen, though fewer 
for the specialist branches than did non-members, even when 
allowance was made for lower intelligence and education. The 
A.T.C., when ability was held constant, provided an excess only 
in the communications branches. Ex-Scouts yielded at least as 
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A he 
i officer candidates and specialists as any oft А 

p tbe Length of membership of the Mew 
qur was found to have a favourable, though only a very be a 
effect on the yield of high-grade material. The pone e 
was that pre-Service organisation membership as such a en inthe 
any effect, except for morse and mathematical training [ew 

A.T.C. The good Showing of members is almost entire Ah NEC. 

able to the intellectual, educational and other qualities of tho 

join the organisations. 0 

А Similar ок: were obtained in the Army when e. 
members of pre-Service organisations, and 5,500 ere EN. 

were studied. On test results, education, and ratings by P.5.O. 


:eations Was 
officer quality and leadership, thé rank order of organe 
Junior Training Corps, A.T.C., Army Cadet Force, nonen he best 
and Sea Cadets. The latter now come at the bottom since t 
cadets natural! 


mbers 
y prefer the Navy. While 121 per cent. of E i 
and only 24 per cent, of non-members were earmarked ee higher 
officers, the difference was/almost wholly attributable ү ble à 
education and intelligence of the former—as shown by Ta 
TABLE XX 


FFICER 
L—ANALYSIS ОР VARIANCE IN POTENTIAL o 
RECOMMENDATIONS ATTRIBUTABLE TO SEVERAL FAC 


е5 
Sum of 5044" 


per сй 
- 19°88 
Differences in educational standard ale ws E rod 
Additional variance dueto membership of a pre-Service organisa- 0.98 
tion ze ns 9: 74 А 2. tU 3 
Additional variance between differént organisations 76:03 
Individual differences (residual) .. m e 


+ se 
a р s of the 
Follow-up to later Stages in the naval or military career: 
recruits would be desir 


: In 2 
able, but was not found puce 
smaller, but carefully controlled, experiment, , fifty- vficien 
Cadets who had obtained the War Certificate A for WE 
Were matched with non-members of equivalent educa 


NAE former 
intelligence, and were subjected to special training. Ыы i accom 
group 83 per cent., of the latter 50 per cent., were able the 
plish th 


Pin p instead 0 
training in four weeks ins hich ws 


е. а Comparatively y 
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Three months later over half the men had been before W.O.S.B.s, 
and the superiority of the Certificate holders was much less marked 
as shown by the correlations in Table XXII. On the other hand, 
the weighted selection test scores used for picking potential officers 
(the O.R.1 Index, cf. p. 46) correlated almost as well with 
W.O.S.B. as with primary training results. 


TABLE XXII.—RzLarIVE PREDICTIVE VALUE OF PRE-SERVICE TRAINING AND OF 
SELECTION "Trsrs 


Correlations with 
Primary WOSB Pass 


Training or Fail 

Possession of Certificate A v. no pre-Service 
experience EN ^H bd Ре г. +747 +24 
Selection test battery (O.R.1) .. - ^^ +`38 . +783 


Effects of Physical Condition T 


Some 250 Army recruits with very poor physique who took a 
two months’ course at Army physical. development centres were 
tested before and after. (Results on the Matrices test alone were 
available for 648 cases.) The gain in scores to be expected on 
re-testing was known from an earlier investigation into the relia- 
bility of S.P. tests. In every test there was a greater gain than mere 
practice effect, though it was often small and. statistically non- 
Significant. The results are expressed on a common percentage 
scale in Table XXIII. The increase in Agility would be expected 


TABLE XXIII.—PERCENTAGE GAIN IN TEST SCORES ATTRIBUTABLE TO 
ATTENDANCE AT P.D.C.s 


Per cent. gain above Probability of gain 


ел practice effect being due to chance 
ЦОС АВЕ 82 DEN 65 <`001 
atrices .. as zs » 
25 Verbal .. fe s 2: » 
8 Assembly sont EX 22 «05 5-01 
2 Arithmetic 06 2:05 
2 Mechanical 0'5 » 


and conforms with an improvement in medical category, height 
and weight. Attached to the centres are picked educational ser- 
Беапіѕ who organise numerous "brains trust" periods, spelling 
bees, and the like. This may account for the rise on the Verbal test. 
The increase on Matrices is less readily explicable, but suggests 


Y 
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that the test involves some kind of mental alertness ke E 
physical alertness, in addition to g. Note, however, that, al Я n f 
highly significant, it only amounts on the average to 13 poin 
core. 
: Several enquiries were made in the A.T.S. into the c 
menstruation. The day within the menstrual cycle on whi 
battery of eight selection tests was taken for a second tine Ta 
ascertained by a medical officer from 1,335 auxiliaries, all of e. 
claimed a normal or twenty-eight-day cycle (some 270 o E 
admitting short, long or variable cycles were excluded). They w 
classified into four “phase” groups: tob 
(i) From four days before to four days after the onse 
menstruation, 

(ii) From fifth to tenth day. 

(iii) Ovulation phase—eleventh to eighteenth day. 

(iv) From nineteenth to twenty-fourth day. ould 
At the original testing the menstrual days were unknown and С an 
be assumed to be randomly distributed. Comparisons of teshi E. 
re-test Matrices scores yielded the analysis of variance pel. 
Table XXIV. There appeared to be slight differences be 


SE 
TABLE XXIV.—ANatysts ОР VARIANCE Dur то MzwsrRUAL PHA 

Sums of Squares P. 

per cent. _ 

Total Variance on Re-testing 5% WI | = 
Accounted for by regression of re-test on test 62:328 >'05 
Accounted for by difference between phases 7066 
Accounted for b 


Г 001 
in the same phase LE Gs 1:056 p 
Observed and expected re-test scores on some days, but they 1 


‹ : i strua 
irregular and were not associated with any particular a lysis 
phase. Phase as such had 


у differences between days 


е В И ven. 
1n DO case was it significant, € 
number of cases, 


state 
investigation, 1,000 auxiliaries were asked Т сё; 
both at test and re-test if they felt unable to do themselves a were 
and if so on what grounds. The most common complaint 
colds 10-4 per cen: 


iIblains 
t., Menstruation 3-45 per cent., and chi 
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2-1 per cent. In no case did the scores of those claiming unfitness 
on either occasion show significant drops. Indeed, the most con- 
sistent difference was a rise in the practical Mec test scores (S.P. 
24) among those undergoing menstruation. 

4 Among some 3,000 auxiliaries enquiries were made at a medical 
interview and 24-4 per cent. reported some pain during their 
periods. A strong association was discovered between this incidence 
of pain and type of A.T.S. or civilian employment. It was signifi- 
cantly higher among women in strenuous or outdoor, than in 
sedentary or indoor, jobs. No significant relationship to educa- 
tional standard was observed, and though pain occurred slightly 
more often among women of below average intelligence on 
Matrices, this was probably due to the association both of pain and 
of low intelligence with low medical category. 


CHAPTER XII 


GENERAL SURVEY OF THE VALUE OF VOCATIONAL 
TESTS 


Abstract.—An outline of the main results of pre-war зеи 
indicates that tests of general intelligence, of mechanical, spati E 
and other abilities, are of considerable value in assessing о 
tional suitability. Mechanical-spatial tests are certainly superar а 
verbal and educational ones in the selection of adolescents р 
engineering training, though giving less consistent results wi 


: m ituation. 
adults. Paper-and-pencil group tests of the miniature situa 


type and diagnostic tests show promise in a variety of educational 
and vocational fields. Though many psychologists consider ue 
such general tests are less appropriate than specialised pd 
tests for particular jobs, the evidence is inconclusive, and thec 7 ТЕ 
depends largely on whether the tests are to be used primarily 
classification and guidance or for selection. : the 
Very extensive follow-up investigations were carried out 1n 
Services. These demonstrated first the value of a standard beds 
of group tests in a large variety of jobs, and secondly, the prs 
small extent to which more specialised tests helped in differ en dé 
ing between different jobs. Mathematical and verbal tests Et. 
to surpass mechanical and spatial tests even in mechanical Zl 
` practical occupations. This tendency persisted when aped not 
as contrasted with training, criteria were studied, but would 23 
necessarily be true under industrial—as distinct from Servie 
conditions. 
oe 


5 t 
So great is the volume of investigations into the value of Wr hat 
types of tests for predicting educational or vocational success, 
We can attempt to give here only a very general summary " ces: 
literature before turning to the main results achieved in d Tos 
The usefulness of group intelligence tests of the ordinary neds 
type in relation to school and university work is well establis 


*A ful fi ex req the Uni ity level is give? y 
use! account tigati t rsií 
Eysenck (1947Ь). — ^ vestigations at the Unive: 


204 


THE VALUE OF VOCATIONAL TESTS ' 205 


but it is generally considered that they have little bearing on voca- 
tions other than those which involve manipulation of words and 
numbers. Thus they give moderate correlations with proficiency 
among clerical workers, but have been shown in various studies to 
be unrelated to success among business executives, telegraphists, 
compositors, motormen, toolmakers, certain types of assembly 
workers, packers, etc. In jobs of a routine mechanical kind high 
intelligence and education may sometimes even be disadvantage- 
ous, leading to dissatisfaction and increased labour turnover. Not 
all these studies will bear close scrutiny, however; the tests used 
Were sometimes inappropriate, the criteria far from reliable, and 
often no attention was paid to the selectivity of the groups con- 
cerned. Other investigations such as those with Army Alpha and 
Cattell’s Scale III (1934), have shown very considerable differ- 
entiation in average intelligence between high- and low-grade 
occupations. Though there is much overlapping, for example, some 
Machine operators being more intelligent than some school- 
teachers, yet it must follow that intelligence has a bearing on 
Success as a teacher. But as such groups as teachers are usually 
highly selected in respect of some factor like education, which is 
itself highly correlated with intelligence, any correlations between 
teaching ability and intelligence are thereby greatly reduced. 
Actually there is quite а lot of evidence of positive correlation 
between intelligence as measured by suitable tests, or educational 
standard shown by college grades, and success in professional and 
administrative jobs. Jenkins (1947) summarises studies of “leaders” 
in industry and other fields. The American Army General Classifi- 
cation Test gave moderate correlations with the achievement of 
commissions by officer candidates, though none at all with assess- 
ments of efficiency in battle. Barr (1946) and his collaborators 
found that when teaching efficiency is objectively measured by the 
Progress made by the pupils, intelligence ‘is one of the most pre- 
dictive factors. In this country Heim (1947) has obtained promising 
results with her high-level A.H.5 test among industrial executives 
and B.B.C. engineers. Farmer and Chambers (1936) applied Group 
Test 33—the N.I.LP. verbal intelligence test, the Cube Construc- 
tion performance test, and various mechanical and hand-eye co- 
ordination tests to numerous groups of engineering workers. They 
obtained correlations up to about -4 between the intelligence tests 
and proficiency in the more highly skilled, though not in the 
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simpler, jobs. In Holliday's (1943) follow-up of pe 
apprentices, Group Test 33 correlated as highly as his spatia алу 
mechanical tests with instructors’ assessments, though less m 

among trade apprentices. He suggests, however, that boys who 
do better on intelligence than on mechanical tests impress their 


assessors as being “bright,” but do not make as good craftsmen. 


Both he and Farmer provide some evidence that tests are more 
predictive of proficiency after several years than they are of success 
in the early stages of training. И 1 
Nothing appears to have been published on the vocational Me 
of non-verbal g tests. They are certainly less useful than verba 
tests in relation to primary school work, but seem to be superior a 
them in predicting mathematical and scientific achievement h 
advanced secondary and university levels. Performance tests | 
as Minnesota, Kent-Shakow, Moorrees and Oakley Formboar " 
Cube Construction and O'Connor's Wiggly Blocks test are viden 
used in industry and in vocational guidance. There is, -— 
remarkably little evidence of their validity. In Rodger's (19* A 
study of several trades taught to Borstal youths, Cube Constr ir 
was the best test. Psychologists have largely employed such A 
for the qualitative indications they yield of methods of work, 
these, too, are in need of validation. ve 
Mechanical assembly tests such as Stenquist's appear to T т 
been used only оп а small scale because of the time requiren i 
individual testing. No follow-up evidence is given on the ri in 
American Army recruits to whom an assembly test was applie 
1917-18. Fair results among boys and apprentices were foun 37) 
the Minnesota investigators (Paterson e al., (1930), Rodger (19 a 
Earle and Macrae (1929), Farmer and Chambers (1936) and оф 
and high validities have been claimed among certain types of me al 
Workers and cotton-mill machine fixers. The Purdue Mechant© 


5 c 
Assembly Test (Tiffin, 1946) is a new and improved version wh 


à uc- 
has been validated among machinists, Group tests based on ded 
tions about workin: 


g mechanical models have been devised Бой 
Cox and Vincent, These appear to be effective substitute ait 
assembly tests among adolescents, according to Hunt and 8 100, 
(1945), Holliday (1943), and Shuttleworth (1942), but they: 
have seldom been tried out on adult workers. :ed voca- 
‚ Group tests of spatial ability or k which have been applied V of 
tionally include the N.LLP. Form Relations and Ме 
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Designs Tests, the Minnesota Paper Formboard, Thurstone’s 
space tests, Group Test 80, Squares and Figure Construction (cf. 
pp. 236-239). Many of the investigators just cited provide evidence 
of their usefulness in selection for technical education and trade 
apprenticeship. Certainly, along with models or assembly tests, 
they are superior to scholastic examinations, and probably to 
verbal intelligence tests, in picking boys for "practical" careers, 
though how far they work at as early an age as 11-12 is more doubt- 
ful. Holzinger and Swineford (1946) similarly report that they 
correlate with shop work and mechanical drawing, but not with 
geometry. 

Several mechanical tests are based on pictures of machines, tools 
or mechanical situations. Cox’s (1928) Diagrams, Explanations and 
Completion tests set out to measure comprehension or mechanical 
understanding rather than experience, while O’Rourke’s so-called 
Mechanical Aptitude test (which also includes verbal items) is 
clearly a test of mechanical information. Stenquist’s test with the 
same name and Bennett’s Mechanical Comprehension test "аге 
Intermediate, the latter also containing problems from statics and 
dynamics, heat, light and sound. There is considerable confusion 
as to what these tests measure, and their titles are often misleading. 
However, as pointed out above (p. 170), the distinction between 
aptitude and attainment is of little practical importance. Cox’s 
tests are considered useful by the Birmingham investigators; 
O'Rourke's was widely used by the Tennessee Valley Authority, 
and the Bennett test was successful among machine-tool operators. 
But there is little further evidence of their validity from civilian 
Investigations. Cunningham (1943), describing the war-time appli- 
cations of psychological methods in Australia, mentions a study 
of twenty tests for selecting fitter and turner trainees in munitions 
industries. The most valuable were intelligence, technical infor- 
mation, and mathematics tests, Cox’s Diagrams, a paper formboard 
and other Ё tests. 

All the above tests are general ones—tests of general intelligence, 
Spatial, mechanical or practical abilities. Most psychologists 
assume that in selection for particular jobs, tests must be more 
Specifically designed on the basis of a careful job analysis. Drake 
(1940), for instance, as a result of extensive experience in a large 
Industrial firm, regards the application of paper-and-pencil tests 
as a waste of time and money. He himself developed dexterity, 
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and no clear trends are discernible in other fields. In motor driving, 
in telegraphy and in music the most successful tests appear to be 
of the work-sample or miniature situation type. Analytic tests such 
as Seashore’s tests of musical talent have been used, but generally 
show poor validities. According to Johnson (1946), a simple com- 
bination of relevant biographical items is as effective in picking 
Motor drivers prone to accidents as are any of the proposed 
apparatus tests of co-ordination, reaction time and the like. Useful 
tests of visual perception for range- and height-finders were 
developed by American psychologists during the war, but the 
application of tests of dark adaptation in eliminating men who 
could not see effectively at night appears to have been a complete 
failure. 

Tests of medical, scientific, legal and nursing aptitudes have 
usually been validated against examinations in these subjects, not 
against skill on the job. Thus both in these fields, and in the selec- 
tion of clerical workers, paper-and-pencil tests either of general or 
of more specialised abilities are found quite useful. For example, 
the efficiency of administrative civil servants was successfully pre- 
dicted by a battery consisting of tests of general intelligence, 
knowledge of current events, knowledge of the Civil Service, inter- 
Pretation of charts and tables, and judgment of administrative 
situations (Mandell and Adkins, 1946). A promising line has been 
opened up by recent developments in diagnostic testing. We know 
that different types of mental patient can to some extent be dis- 
tinguished by projection, sorting, abstraction and other tests; and 
Thurstone’s (1944) factorial analysis of perceptual tests has 
revealed “qualities of mind" over and above £, 2 and k, which 
appear to discriminate such groups as administrators and student 
leaders. Munroe (1945) has developed a reasonably objective 
method of scoring the Rorschach inkblots* to yield a measure of 
emotional instability which correlates well with failure at a uni- 
Versity among students of good ability. The same test is claimed to 
differentiate mechanics from youths more suited to other types of 
job (Piotrowski et al., 1944). Thus, there is reason to hope that the 
future will see the isolation and measurement of some of the special 
qualities possessed by successful officers and leaders of men, by 


M Here the Rorschach test was given to groups, but as nearly as possible in 
the original form. The further modification where multiple-choice responses 
are provided, so that scoring is wholly objective, is much less successful (but 
cf. рр. 256). 
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executives and administrators, by teachers, research workers, 
salesmen, interviewers, and so forth. 

At the moment, however, while many tests have been tried out 
in these occupations, there are none that can be confidently recom- 
mended. Indeed, it is fair to conclude that successful guidance ог 
selection of such persons has been based chiefly on the interview 
and background questionnaire, supplemented by tests of general 
intelligence, and personality and interest inventories. 


Follow-up of Tests in the Forces 


During 1942-6 there were seventy-six follow-up investigations 
in the Navy alone, covering over 31,000 recruits, in the course of 
which some criterion of proficiency was correlated usually with six 
or more selection tests, often with other data such as source or 
mode of entry, age, education and civilian occupation; sometimes 
also with numerous items such as interests or leadership experience 
taken from the recruits’ biographical questionnaires. The occupa- 
tions included seamen, gunnery and torpedo rates, radar and asdic 
operators, stokers, stewards, cooks, photographers, safety equip- 
ment ratings, cinema projectionists, supply assistants, writers, 
R.N.V.R. officer cadets, artificer apprentices, electrical artificers, 
electrical and radio mechanics, motor, engine-room and ordnance 
mechanics, wiremen, air fitters and mechanics, telegraphists, signal- 
men, telegraphist air gunners, Fleet Air Arm pilots and observers, 
naval instructors and W.R.N.S. personnel selection staff. The 
groups ranged in size from about 30 to 3,000, but the median size 
was 300. In all these studies the tests were given, or other data 
collected, on entry and the recruits’ success or failure traced later, 
usually at the end of training. They do not include validatory trials 
of new tests. 

Soon after the introduction of regular selection procedure (the 
General Service scheme) into the Army, 2,500 recruits in some 

twenty representative jobs were followed up, and the test battery 
was modified in the light of the findings. Numerous subsequent 
investigations were made of jobs where specific information was 
needed, e.g. in order to set appropriate test standards. In the main 
A.T.S. follow-up, some 6,000 auxiliaries were studied in the 
twenty-seven commonest jobs, the median size of sample being 
106. Validation results in the R.A.F. are given in Chapter XVI. 

The outstanding facts revealed by this work were the value of a 
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multiple correlation of -47 rises to -65 when this selectivity is | 
allowed for (in so far as statistical techniques are capable of making 
such allowance; cf. Chapter VII). The size of these average or 
median coefficients is distinctly greater than would have been 
anticipated from civilian experience, though obviously still too low 
for accurate selection in the absence of other information. But the 
Tables show that the correlations varied considerably in different 
investigations. It was noted that the highest coefficients were 
usually obtained in jobs involving lengthy training, including a fair 
amount of theoretical work, where the final assessments of pro- 
ficiency were based on thorough examinations, and where no 
Scheme of selection run by psychologists was already in operation*. 
The lowest coefficients occurred in jobs where the work is highly 
specialised (such as radio mechanic), or where previous trade 
experience is of paramount importance, also in jobs (such as sea- 
man) where assessments of efficiency are based more on personality 
qualities, e.g. dependability,than on any definite skill or knowledge. 


Validities in Different Types of Jobs 

We would naturally have expected the verbal and educational 
tests to show relatively low validities in mechanical and practical 
occupations, and the mechanical-spatial tests to be of value only 
among mechanics. But such differentiation was conspicuously 
small. In some jobs all the tests might achieve high coefficients, in 
other branches all low, but the relative validities of the different 
tests were remarkably uniform. Tables XXVII and XXVIII com- 
pare the mean validities in three main types of work in the Navy 
and the Army, and while they show that the mechanical and spatial 
tests were, indeed, relatively less useful than verbal-numerical 
ones in clerical and communications jobs, yet they were just about 
as useful among seamen, infantry, officers and gunnery ratings as 
In the Navy the Mathematics test obtained the highest validity 

in most branches and the Squares test the lowest. Often, indeed, 

the Mathematics test gave better coefficients than T2—the sum 


of four tests. The Bennett Mechanical test ipe 3 pe Meo 
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board large American ships, the highest average validity was 
obtained by an arithmetic test, followed by a verbal intelligence and 
mechanical knowledge and aptitude group tests. When the U.S. 
Army revised its general classification test for all recruits, it chose 
a battery consisting mainly of arithmetic and reading tests, pre- 
sumably because such tests had proved most valid for Service 
purposes. ` 

The conclusion followed, in this country, that almost all 

branches of the Forces wanted the same type of man—one with 
good education (especially in mathematics) as well as good intelli- 
gence. Thus the main use of tests was for apportioning the available 
supplies of high-quality men among the different branches accord- 
ing to their needs. Differentiation between jobs was based more on 
interests and interview judgments. Several possible explanations 
ofthis result may be proposed: 

(1) It is shown below that tests like Clerical and Arithmetic are 
more reliable than mechanical-spatial or other vocational 
tests. This alone would help to raise their validities. 

(2) Although such tests do depend on education or the v : ed 
factor, they actually have as high g-saturations as non-verbal 
tests such as Matrices and will, therefore, correlate well with 
any job proficiency involving g. 

(3) It is possible that success at these tests depends on certain 
personality or temperamental qualities such as stability and 
persistence, in addition to g and education, which are 
relevant to vocational success. 

(4) The extreme heterogeneity of recruits in g and education is 
certainly an important factor. Applicants for any one type of 
civilian employment would be unlikely to range from uni- 
versity graduates to mental defectives. Most people do not 
apply for jobs which are quite outwith their capacity. In 
a more homogeneous group there would be greater scope for 
specialised tests. 

(5) Jobs in the Forces tend to be more variegated than most 
civilian employments. Both at the semi-skilled level (e.g. 
seamen, infantry) and at the highly-skilled (e.g. radio 
mechanics), the recruit is not engaged in one specific type 
of work. Even when he is mainly concerned with particular 
equipment and machines he needs a good deal of adapta- 
bility to be able to service old, or freshly introduced, models. 
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Assessments of his proficiency are likely to refer to numerous 
different duties, hence general tests may be more appro- 
priate than specialised ones. 

(6) American Service psychologists have presented evidence 
from several naval jobs to show that mechanical and other 
specialised tests are more valid than verbal educational tests 
when the criteria consist of objective measures of proficiency, 
but less valid when théy consist of instructors' gradings. 
The implication is that such gradings are based more on 
written work and on general impressions of brightness than 
on practical skill. While it is true that the criteria employed 
in this Country consisted largely of subjective gradings 
(Types C and D rather than Type A), the superiority of 


verbal tests certainly did not occur only when these gradings 
involved written work. 


(7) Admitted! 
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Taste XXIX.—T2 S.G.s or SIGNALLERS OBTAINING HIGH or Low 
ASSESSMENTS FOR OPERATIONAL EFFICIENCY 


T2 Gradi 
Assessment шш: Total 
A B е DorE 
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Passable . ч $ М 1 1 12 2 22 
Below standard . s s 1 1 10 T 19 
45 | т 68 13 | 200 


but the spatial Squares test does improve when compared with an 
operational criterion. In Coastal Forces, 185 motor launch and 
motor gunboat ratings were assessed on several qualities by their 
officers (by the conference method, cf. p. 110). Correlations with 
"fighting qualities” were around zero, but with “dependability” 
the coefficients for Abstraction, Mathematics and Squares were ‘25, 
‘17 and -30 respectively. Thus in this instance, the spatial test did 
Show up better than v : ed. tests. 

Some 7,000 men in all Arms of the British Liberation Army 
Were assessed on the questionnaire reproduced on p. 109 at the 
conclusion of hostilities with Germany, and the test scores which 
certain groups had obtained on recruitment were traced. Among 
260 infantry the Arithmetic test gave a correlation of -263, and all 
the other tests smaller coefficients. While this figure is low, the 
reliability of the assessments may be largely responsible; for the 
correlation of the operational assessments with efficiency gradings 
given shortly before the invasion was only :297. 

In the A.T.S. over 600 trainees for anti-aircraft duties were 
followed up, and later some 1,300 were assessed for efficiency after 
Serving two or more years. The average validity coefficients 
(corrected for selectivity) are shown in Table XXX. Most of the 
tests, and the multiple coefficient, drop considerably in validity at 
the operational stage. But Spelling achieves a much higher 
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coefficient, and the Clerical test is relatively more valid than at the 
training stage. 

Another approach was to contrast the test scores of men pro- 
moted to higher naval rates with lower rates, on the assumption 
that the Navy would select the most efficient rather than the most 
educable for advancement. The T2 differences among gunnery 
and torpedo rates are portrayed in Fig. 4, and these, together with 
the differences on other tests are expressed as correlation ratios 
(which are similar to correlation coefficients) in Table XXXI. It 
happens that the absolute correlations are much higher in the 
torpedo than the gunnery branch, but the relative order of valid- 


ities is the same in both, and the mathematics test greatly surpasses 
the others, including T2, 


TABLE XXXI.—ConnzLATION Ratios ron NAvAL SELECTION TESTS 
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should, as far as possible, have had equal amounts of 
previous practice with tests. 

Such group tests chiefly indicate the all-round proficiency, 
adaptability, and acceptability to employers, and not any 
particular skills. In other words they show the general level 
of occupation for which a candidate is fitted, and do not 
readily differentiate between different occupations at the 
same level. At the same time it is easier to determine this 
level and to add supplementary tests where needed, than to 
follow the principle of preparing an elaborate battery of 
specialised tests for each specific occupation. When full 
weight is given to interests, previous experience, etc., as well 
as to this general level, it may often be found that there is not 


very much left over which requires testing. 
\ 


CHAPTER XIII 
VERBAL INTELLIGENCE AND EDUCATIONAL TESTS 


Abstract.—Ordinary verbal intelligence tests are shown to be less 
appropriate for adults (particularly those of below average ability) 
than tests of the abstraction type. An account is given of the merits 
and defects of clerical tests, oral directions, vocabulary, reading 
comprehension, verbal fluency, dictation and spelling, arithmetic 
and mathematics, and dial and instrument reading tests. Examples 
are given of the tests found most useful in the Services. Selected 
follow-up investigations illustrate the value of such tests among 
Army lorry drivers, naval asdic operators and Writers, and R.N.V.R. 
officer cadets. 
== n 


Most of the intelligence tests in common civilian use consist of 
batteries of sub-tests, each sub 
response items such as vocabulary, analogies, classification, come 
pletion, reasoning problems, etc., and having a strict time limit of, 
Say, two to ten minutes, Sometimes, in the so-called omnibus test, 


items of all types are mixed up, and are explained and practised 
before the test Proper begins. 


Were quite widely applied in the Services, 
-*- Group Test 33, and the first parts of the 


out sample questions. This added 
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sometimes to be seriously affected. Another drawback was that the 
conventional test items and their choice-responses often struck 
senior officers as artificial and quite unrelated to the soldier’s or 
sailor’s jobs. It was very necessary in the early days of personnel 
selection to win their good will, and it was not easy to answer the 
criticism which they sometimes raised, namely, that tests of this 
type are chiefly constructed by university psychologists for testing 
school-children, and are inappropriate for men and women who 
have long grown unaccustomed to paper-and-pencil work and to 
manipulating verbal symbols. Our results on changes in ability 

, between 14-18 years (Chapter XI) lend some support to this view. 
Factorial analyses showed that V.I.T. and similar tests are good 
measures of g, but that they are also dependent on verbal-educa- 
tional ability. Thus it is misleading to regard them purely as 
measures of intelligence. Finally, the fact that they mostly have to 
be done at maximum speed is objectionable, for, as Cattell (1943) 
shows, the performance of adults is more seriously affected by age 
in speeded tasks than in tests with a generous time limit. A novel 
form of verbal test, known as abstraction, appeared to provide the 
answer to these difficulties. 


Abstraction Tests ‘ 
Early in the war a pair of shott tests of vocabulary and abstrac- 
tion was published in America by Shipley (1940). They were 
intended for mental or brain-injury patients whose reasoning 
Capacities often deteriorate more seriously than does their “‘crystal- 
lised" intelligence, shown by knowledge of words. The vocabulary 
test (S.P.5) was occasionally used in the Forces, but was replaced 
by the more suitable tests described below. Adaptations of the 
second test, however, were very widely applied in the Navy (Test 1 
and В.С. Test Р) and by War Office selection boards (S.P. 45). An 
abstraction test contains items like the following*: 
Wherever you see an asterisk, оле letter or number is missing. 
Write in the missing letters or numbers. 
DA Mig GN OMe? ОНА 
5 15 25 35 45 ae 
big little ^ rough smooth ^ hard **** 
OZ. Ib. stone cwt. 


* No items quoted here are taken directly from tests which are in use, but 
are similar to them. The instructions are usually abbreviated. 


KER 
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analogies, number series, cipher коре 
and so оп. (Items analyses of several tests showed the рае 
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The reliability coefficients of tests with only twenty items an к. 
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Д . ique 
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Clerical and Instructions Tests 
A clerical test was Constructed in 1942 for Army and ATS. 
clerks, which resemble 


g printed information —in rapid тое, 
Instead of in separate sub-tests. This was proved to be a good tes 


е, as also was the In 


ent letters, write a X in column (1). If they 
ame letter, leave a blank. 
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2. Count up the total number of vowels (A, E, I, O or U) in 
both words and enter it in column (2). 
4, Classify the longer of the two words under the following 
headings: 
C. Names of countries. 
L. Living things, animals or plants. 
N. Non-living objects. 
If the first word is longer, write C or L or N in column (3). 
If the second word is longer, write C, L or N in column (4). 
Leave the other column blank. 
5. In column (5) write a X if the two words belong to the same 
class. Leave a blank if they are different. 
Go through the three examples below, and then work 
across each line*, answering as many of the thirty lines as 
you can in ten minutes. 


p 


(1) (2) (3) (4) (5) 


France England X 4 c x 
Motorcar Wales x 5 N 
Elephant Emerald 6 L 


1 Chair Bed. 
2. Rose Russia 
Etc. А 


It is difficult to conceive why such а test should be successful, 
for it appears at first sight to suffer from most of the defects of 
verbal tests listed above. Yet factorial analysis proves that it is as 
good a test of general intelligence as abstraction and better than 
Matrices. It is not greatly affected by education, but does involve 
a purely clerical factor to a small extent. It seems to depend on: 

(a) Comprehending the rather elaborate instructions. For this 

reason the test, as used in the Army and A.T.S., was re- 
christened. 

(b) Learning the instructions rapidly, in order to avoid going 

back to the beginning and consulting them frequently. 

(с) Good vocabulary and ability to abstract the meanings of the 

words under the given headings. 

(d) Mental flexibility and speed, or ability to change over 

quickly from one operation to the next. 


Se is desirable to make the operations inter-connect as closely as possible, 
Otherwise testees will be tempted to work down each column in turn. 
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(e) Thinking out efficient methods of work, e.g. doing two or 
three operations simultaneously. А UON. 
(f) A temperamental factor of drive, for maintaining 
necessary concentration throughout. 
Although (d) and (f) would not normally be regarded as com- 
ponents of general intelligence, they may enter into many: occu- 
pations, and so enhance the vocational value of the test*. 


Directions Tests 
The ordinary written di 


rections item does not appear appro- 
priate for adults of sub 


-average intelligence, for example: 

If the fourth word in this sentence is longer than the fifth 
word, write down the second letter of the last word; if not, write 
down the last letter of the third word. 

But an oral directions test where complex instructions are read 
out by the tester was used successfully in the Army Alpha battery 
and, in this country, gave useful predictions in several jobs such as 
asdic operator and motor/driver. Men who are unaccustomed to 
reading and writing are much less handicapped than with a printed 

псу in the Forces depends so largely on 


prompt reaction to, ora] directions that this 
test might be regarded as a work 


t With dictation tests, gramophone 
Y Poorer scores than oral testing (even 
€ by a B.B.C. announcer). 


Vocabulary and Reading Tests 
explicitly involy 
for Several jobs, including cl 


1 its score distributions. 10 per cent. 
Population can fy й istributions. Nearly 10 p 


f » and almost as many 
a -shaped distribution ЕС Scores, unless the time limit is reduced. Such 


for most selection urposes, but is 
o the statistician. Purposes; 
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Navy added considerably to the standard tests. This test contains 
six passages and thirty choice-response questions requiring the 
abstraction of information from the passages. A much simpler test 
(S.P.22), in two parallel forms, was devised for semi-illiterate 
Army recruits and standardised on children to yield reading ages. 
This was used in an experiment on the effects of basic education 
courses, where it was shown that the improvement among illiterates 
sent on a six-weeks’ course by their own units was small and 
irregular; but that among men selected by Р.5.0.5 and psychia- 
trists as likely to benefit from the courses, the improvement was 
much larger. Two tests of attainment in English, chiefly involving 
reading comprehension and vocabulary, were included in the 
R.A.F. air crew batteries and one of these (Gen-B) was found to 
give promising predictions in all categories except pilot. 

Forindividual testing, e.g. of naval neuropsychiatric patients, the 
vocabulary, comprehension and similarities tests of the Wechsler- 
Bellevue scale (S.P.6) were used (cf. Trist, 1941). With Army and 
A.T.S. officer candidates, the Mill Hill vocabulary test (Raven 
and Walshaw, 1944) was satisfactory. But when older, serving 
officers came up for re-allocation both this and other verbal tests 
often seemed to arouse a good deal of anxiety, and a more suitable 
test was devised consisting of fifteen words (such as FORM and вїт) 
for each of which four different meanings are to be given in writing 
in fifteen minutes. This test, the abbreviated Wechsler, and 
Shipley's vocabulary, all have much the same factor content, 
showing fairly high dependence on g, but also having large verbal- 
educational loadings. 

More reliable than any of these was the Army's Verbal test 
(S.P.25), based on synonyms, homonyms, and rhymes. Specimen 
items are shown below: 

Write down on the dotted line a word which means nearly 
the same as the word on the left, and which starts with the two 
ют three letters that follow: 

Example COMMENCE,..... BEGIN...... 
IS ЕНЕКТ уу. 5.9.9 ЖҮ CAR tes 5 

Write a third word on the dotted line which rhymes with the 
left-hand word and means nearly the same as the right-hand 
word: 

Example RANGE eese Ae ein СВапре............... ALTER 


+ Р.8.—8 


" 
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Write a third word on the dotted line which means nearly the 
same as the left-hand word, and, in a different sense, means 
nearly the same as the right-hand word: 

Example RELISH 


Like the abstraction test, this has the advantage of objective 
scoring without choice-responses. (Indeed, Shipley's original test 
contains an item of the third type just quoted.) 

The main R.A.F. battery, devised by Stephenson (cf. Chapter V) 
has a V or verbal section. This too, contains synonyms and 
opposites, reading comprehension, and questions on verbs, nouns, 
and adjectives, e.g.: 

Make adjectives (describing words) from the following nouns 

(naming words). The first has been done for you. 

Example LEAF Leafy 
l. HEAT 
2. AFFECTION 


Dictation and Spelling Tests“ 

Ordinary passages of prose are unsuitable for dictation, since so 
much time has to be spent on easy, non-discriminating words. 
Several parallel passages (S.P. 70—74) were devised for the Navy, 
each containing thirty-five words, almost all of which were fairly 
to very difficult. But these were barely long enough to be reliable, 
and, although satisfactory when given by carefully trained testers, 

cted to some extent by their enunciations 
periment an experienced А.Т.Ѕ. officer cave 


Successive for excessive 
In ; 


» and 
Is » has 
The. » though 
Gratified 55 gratifying 
Patrolling » patrol. 


A better form of test (S.P. 127-130) consists of twenty words, of 


" 
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a suitable range of difficulty, each incorporated in a sentence 
which helps to define the word and to determine its tense, part of 
speech, etc. The whole sentence is dictated, but only the critical 
word is written down and scored. For example: 

onions. I grow ONIONS on my allotment. 

SEIZE.  SIEZE hold of the rope and pull it. 

A speed of twenty seconds per word, including its sentence, was 
found adequate. Even in this test the influence of the tester was 
apparent. For example, five testers, dictating PLUMBLINE to com- 
parable groups of about 180 recruits, obtained 40, 50, 41, 34 and 51 
per cent. of right answers. Presumably the second and fifth testers 
pronounced the word much more intelligibly than the others, 
perhaps sounding the B. 

Objective spelling tests are the only way out of this difficulty. 
In the A.T.S. test (S.P.14), five incorrect versions and one correct 
version of each word were mixed up. A synonym at the beginning 
of each line helped the testees to identify the word. The right 


, Spellings had to be underlined. For example: 


RAPID qick kwick qwick quick quic cwik 
GRASP seize sieze sease seez size siese 

Admiralty psychologists adopted an intermediate form consisting 
of sentences in each of which one word was mis-spelt, the testee hav- 
ing to find this and write it out correctly (R.C. Test C). For example: 

He is a very qwick runner 
Sease hold of the rope and pull it. 

In one investigation, six varieties of dictation or spelling tests 
were given to the same recruits, and it was concluded that they all 
measure the same ability almost equally well. (This conclusion 
would probably not be true among school-children.) All have 
moderate g, but larger verbal-educational saturations. Straight 
dictation, however, probably involves least g, and the A.T.S. type 
of test may depend to a slight extent on.a distinct factor of ability 
at clerical work. The straight dictation was the least reliable, but 
the reliabilities of all other types of test were almost equally high. 
The recruits’ preferences were recorded, and again there were no 
outstanding differences. In another factorial study the sentence 
dictation and A.T.S. Spelling tests were shown to measure the same 
abilities as the Army Verbal test. In a representative male adult 
population ability to spell does not seem to be differentiated from 
general education in words. 
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Arithmetic and Mathematics Tests 
Both the Admiralty and War Office psychologists constructed 
tests in two parts, the first consisting of a few minutes of straight- 
forward addition, subtraction, multiplication and division sums, 


the second including thirty to forty brief questions for which 
eight to ten minutes was sufficient time. For example: 


How many pence in half-a-crown ? 
14 + 2} = 

Increase 50 by 10 per cent. 

sec À " 


Sz М 
COSECTAR о US D auohoues 
Scores on the first 


positively, skewed. In other words the lower half of the population 
is best differentiated by its skill in rote arithmetic, the upper half 


aborate arithmetical problems, and which is four 
Чу commented on the surprising 
езе tests and the fact that, even in relation to 

ten gave better predictions of proficiency 
» and among seamen, infantry, and officer 


vocational value of th 


mong high-grade groups 

and the two parts together 
Saturation, particularly of the 
elligent adults both learn and 


(electrical mechanics, 
among average recruits, The high g- 


Tuctors as never likely to make 
€st than on any other, this does 
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not mean that the instructors cannot tolerate the badly educated, 
but rather that most of these men are either innately dull, or 
maladjusted individuals who fail to settle down to naval life, just 
as they failed to settle to school life. Similarly, Lummis (1946) has 
shown that bad Army conduct records are associated more closely 
with low scores on arithmetic than on other S.P. tests, and that 
arithmetic is most seriously affected by irregular schooling and 
truancy. It was interesting to observe moderately high correlations 
even with assessments of officer quality and power of command 
among R.N.V.R. cadets. Thus when the top quarter of a group of 
550 assessed most highly was contrasted with the lowest quarter, 
it was found that 56 per cent. of the former and 30 per cent. of the 
latter were acquainted with Pythagoras’ theorem, 78 per cent. and 
58 per cent. knew the cube root of 64, and so on. 

. Ап alternative type of test (S.P.23) was preferred in the A.T.S. 
because the mathematics test was too difficult. (The average female 
adult cannot even manage decimals.) Worked sums were provided 
in which there were three mistakes which the testee had to correct. 
For example: 

\ 


СЕ 

Subtract 456 12 8 
269 5 10 

£195 6 8 


Such a test, with ten sums to be checked in ten minutes, is also 
very reliable, and measures nearly the same abilities as the others, 
but depends to a small extent on ability at clerical work. From the 
Vocational viewpoint, arithmetic was often less predictive than 
spelling among women. 

For its high-grade air crew recruits, the R.A.F. adopted several 
more difficult American tests, all of selective-response type. Mat-A 
tests algebra and trigonometry, Mat-B contains verbal problems, 
and Mat-D rote arithmetic. Mat-C requires testees to find 
approximate answers to such questions as: 

248-4 miles per hour for 20 hours and 15 minutes = (a) 513 
miles, (b) 50 miles, (c) 5,000 miles, (d) 500 miles, (е) 503 miles. 
Mat-F involves reading off the correct entries from numerical 
tables. The two latter tests bear a close resemblance to some of the 

Jobs of the navigator. , 
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Instrument Reading Tests к s 
Two other tests borrowed from the U.S. Army a pae 
d useful in the allocation of air crew categories. NIS 
25) f dials, as on an aeroplane panel, marked Altitu е, ling for 
Ratio. etc. There are fifty-seven choice-response oe T а 
the correct teading of appropriate dials. In aes i жы E 
Six dials have to be read and related jointly xh = = eae а 
aeroplane’s velocity, orientation, altitude, etc. a eto M 
of two dials—Artificial Horizon and Compass. e a et 
read and related to illustrations of aeroplanes in P а Mei 
item there are five planes, only one of which can be re 
ir of readings. m 
OM test (S.P.119) was devised for naval а А d 
The first part shows several pictures of bits of ru ег: Jae By 
dials, the testee having to read off the p "ge d 
pointers (with or without interpolation). The second p: 


: na very 
reading the co-ordinates of points on a rectangular, and o 
. simple polar, graph. 


Illustrative Investigations 


: їз тау 
Many of the above Statements and conclusions about tes 
appear somewhat dogmatic. It is im; 


TABLE XXXIL— Varmrry COEFFICIENTS OF TESTS FOR DRIVERS 


po 
Age. 5 * / 7 р К > ө i . z .31 
Matrices T z : i , $ Wo tes . b -586 
Bennett Test 2 ' - s - а 5 A = . Е 
ersonality inventory 1 


Previous driving 


-439 
ELE м. ®]; 511 
experience. ? : 3 й ‘ : -382 
Cycling experience | А я s У t 5 E 4 +486 
Oral directions test » 1 р i а к . . d 431 
Uestionnaire оп driving knowledge ` > Р А . E 423 
Interest blank Scored for drivers - 5 А : к җа £ -335 
Тоир choice reaction time . 5 è е 2 ` < "987 
quares, Test 4 , К i 5 2 и à « . : +266 
Interest blank Scored for mechanics у X 6 a . : +212 
udgment of distances д ; у 3 E . Е "200 
udgment of ellipses . б : E А P А . s 
Judgment ОЁ speed `| J - E 
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In another similar group of 306 recruits Chleusebairgue’s (1939) 
elaborate choice reaction time test gave a correlation of -385. The 
multiple correlation of the first four items on the list combined is 
669, and the figure does not rise to more than -68 ог -70 when any 
other test (except driving experience) is added. Note that the three 
analytic tests, borrowed from the N.LI.P. battery (Miles and 
Vincent, 1934) come at the bottom, and that oral directions is the 
best test after Bennett. The interest blanks and inventories are 
discussed in Chapter XV. 

Asdic Operators.—Several experiments were carried out in anti- 
Submarine training schools, on the basis of which an omnibus 
intelligence test (Selection Test A), mainly consisting of oral 
directions items, a gramophone test of sense of pitch (Doppler 
efféct), and a group choice reaction time test, were installed as a 
regular battery. It was also found that Heim's A.H.4 test, contain- 
ing verbal and non-verbal intelligence items, was a satisfactory , 
alternative to Selection Test A. Later an individual audiometric 
examination was added by the medical officers. One sample of 282 
trainees, selected in respect of the above battery and A.H.4, but not 
directly selected on T2 2, yielded the correlations with training 
results shown in Table XXXIII. 


TABLE XXXIII.—Tue VALUE or DIFFERENT INTELLIGENCE TESTS IN SELECTING 
Aspic OPERATORS 


Correlations sens T 
Asdic battery i Я z : $ E "343 *553 
A.H.4 *384 '572 


a d usd NEL Pa УТИ. -440 -597 


When statistical correction for selectivity was applied, all three 


tests or sets of tests showed similar, fairly high, validities. 
\ 


TABLE XXXIV.—Tue VALUE OF INTELLIGENCE AND OTHER TESTS IN SELECTING 
ASDIC OPERATORS 


Tests Correlations 
Intelligence, Selection Test A А Е à 4 F 417 
Group choice reaction time . , 4 1 А a *045 
Doppler Record . 3 i * $ " 5 А 179 
udiometric threshold . 7 i 3 5 P B 145 
Audiometric sense of pitch . = : 4 а 3 165 


In the course of time the reaction time apparatus, and its method 
Of application, became unreliable. Nearly four years after the 
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ini shown 
institution of the tests the correlations with training wi E 
пе were obtained ae ire end 
| wi be seen that the intelli es 
EE xm as before, and the PR Ee E 
slightly in selection. The audiometric tests, thoug 
not appear superior to this group auditory ay" ERES s 
Writers and Supply Assistants —Some 8 Мере: 
were followed up іп 1943, and their scores ay е ced oer 
correlated with training results, Four to six и shown in ТОЁ 
iven additional tests in 1944-5, with the results np. 
XXXV. (Tests 2 and 4 are omitted since they em [ис 
Coefficients.) The second set of figures for T2 pog Wi 
the first because of more efficient selection, an 


d 
Р have, ha 
tests attain rather higher coefficients than they would , 
they also been used for selection. 


ORK 
ICAL W 
TABLE XXXV. Tug VALUE or TESTS IN SELECTION FOR CLER 


Correlations in: 
Test _——— 


Abstraction | 3 
3a Arithmetic y 
3b Mathematics 
T2 


Т1 Dictation | C { 
Amny Instructions 


Group Test 25 (N.LI.P. Clerical) Ё 
96 U.s. Reading comprehension | 


\ the 

Nevertheless, it was found that either of the clerical Weg 
reading comprehension. would add appreciably to pre 

based only on the Standard battery, М 

V.R. Officer Cadets. Тә Was 

raining in Н.М 


› the result: 


5 are expressed in 
Correlations with T2 (int 
(in the second column) held 
ranged around “535, and the у; 
between -30 and -55, 
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TABLE XXXVI.—PanriAL CORRELATIONS SHOWING THE ADDITIONAL VALUE 
OF Various TESTS IN OFFICER SELECTION 


Partial Correlations : 


Test 


T2 Constant |1 + ЗЬ Constant 


96 Reading comprehension $ : š "20 
Mathematics examination. i . 17 
Ei unipyren аи шур. З ЕС + Sy) 
12 Army Clerical 

21 Army Instructions . Ё 
Group Test 25, N.LI.P. Clerical 


45 W.O.S.B. Abstraction . 


о CY ы с. 1. 2 тї 
Group Test 33, N.LI.P. intelligence . 13 
5 Shipley Vocabulary Е E . 

109 Figure construction . ш 


97 Memory for designs 
80 Orientation 


Group Test 70/23 non-verbal 


rogressive Matrices, 1942 
— 


The Clerical or Instructions tests certainly have a contribution to 
make, and are probably superior to the conventional form of 
clerical test (Group Test 25). Reading comprehension, too, is use- 
ful, and possibly the mathematics examination set by the training 
establishment itself. Other verbal intelligence tests are of more 
doubtful value, and non-verbal ones appear quite useless. Туб 
Spatial tests add very little, but a verbal orientation test (discussed 
in the next Chapter) seems to be one of the best. Unfortunately, 

^ it was given to only 155 cases, hence its results are less certain. 


CHAPTER XIV 
NON-VERBAL AND MECHANICAL TESTS 


intel- 
Abstract.—The Progressive Matrices and other he of 
ligence (g) tests were widely used in the deris Mises on tht 
spatial judgment (k-factor) were tried, Though thes engineers 
whole less valuable among adults than adolescents, sp. ей 
ing apprentices, there is some evidence that they us m for арр 
"practical? ability. Performance tests were deve ope sal vast] 
cation to African and Indian recruits and other рев Е 
Orientation and observation tests are described. Among t! E al conte 
mechanical tests, group paper-and-pencil tests of pues Pam, 
prehension, mechanical and electrical information and tra ably 68 
ledge, were generally more predictive than practical asse сс 
trade tests. Illustrative investigations show some of mor aid 
obtained among naval radar operators, boy tradesmen, radi 
electrical mechanics, ; 
——— 7 

The Progressive Matrice 
primary general intelligen a 
A.T.S. in 1941, largely in order to ward off criticisms of the educa 
tional bias in verbal tests i 
Fig. 5. This test has 


est. 
rices is an almost pure g pe k 
-Spatial or k factor to a small extent. 


Е SA: ons 
аз somewhat disappointing. Three г in 
may be suggested, First, the success ОЁ such tests as instruc 

and Arithmetic showed that the Se 


. I > 
n pointed out in Chapter XI. d 
Whenever a battery of verbal and mechanical-spatial tests 
234 
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E Yee 
еб үз. 
Ose 


DOLD 
ODOR) 


Fic. 5. SPECIMEN ĪTEM As IN THE Procressive Matrices Test. WHICH OF 


THE NUMBERED Pisces 1-8, IF FITTED INTO THE Empty SPACE ABOVE, Моор 
CoMPLETE THE PATTERN? 


applied, their combined result yielded a considerably better mea- 
sure than did Matrices alone. Only when its visuo-spatial com- 
ponent was involved did it seem to have anything to add to such a 
battery. Thus its best correlations were with proficiency in visual 
Signalling and radar operating and among А.Т.5. recruits engaged 
9n various mechanical and anti-aircraft duties. The third reason is 
Its rather poor reliability, and its susceptibility to non-intellectual 
influences. More than any other group test it is affected by age and 
other types of mental deterioration, possibly also by temporary 
emotional stress. 'The improvement in scores after attendance at 
Physical development courses was described above (p. 201). 
Dr. J. A. Fraser Roberts was able to show, by a detailed compari- 
Son with more reliable tests, that the unreliability is greatest in the 
15-30 score range, i.e. just about the level where acceptance or 
rejection for the Services takes place. Its efficiency is greater the 


i 
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ee a 
higher the score (in spite of the negative skewness 5i шры 
tion), and also at very low score levels. Doubtless, t л а clinical 
valuable when given without time limit by an "up ARCU 
psychologist who can interpret the significance pi cent 
particular test items. No evidence could be obtained, 10V ге 
men with scores which Raven calls "unreliable", i.e. i WC 
patterns of scores on the five sections of the test, Н 
neurotic or іп any other way different from men with * abe 
scores (cf. Eysenck, 1944). The test is seldom given nov a 
Services, mainly because its wide use has caused it to è 
rather well known. 2 

Another matrices test, where the testee draws in his e 
instead of choosing one of a number of patterns, was Meme 
Stephenson for the g section of his R.A.F. battery. Thoug le. The 
times difficult to Score, this version seems to be more PET Ned 
N.LI.P. Group Test 70, which was occasionally used in the и 
and Army, has three parts of which the last two resemble Р E M 
sive Matrices. Part Lis a very simple but surprisingly effective id 
Five two-line drawings are Shown, with a long series of шерам 
bits of these drawings. The testee has to identify the Eat 
drawing to which each bit belongs. In the absence of de ae 
evidence, it is Possible that this test involves g for much the oe 

i ctions test, namely, rapid learning of the ane 
ntal flexibility, and continuous concentra “tal 
Parts show some dependence on visuo-spa' 
ability. An alternati: ; 


à small extent among dull and backward recruits. 

h Spatial Judgment Tests А ane 

The spatial test most widel used in the Navy, Army and A.T-9 
was the N.LT А у in the avy, у 


$0 formed would, if turned around; 
make a square, The N.LIP. 
1940) was occasio 


— 
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Form Relations test was seldom employed because of its heavy paper 
consumption the aecompanying Memory for Designs (S.P.97) was 
often given in the Navy. It has the drawback of being very difficult 
to score. A test based on the copying of straight-line figures on to 
a space marked out with dots has been used by Stephenson, 
McQuarrie, Slater (“Figure Construction Test") and others (cf. 


= a 
Fic. 6. Squares Test. IN THE LEFT-HAND FIGURE, A LINE Drawn From А 
то B Wourp Divine тне Ficure INTO Two Preces WHuicH COULD BE PUT 
TOGETHER TO MAKE A SQUARE. IN THE RIGHT-HAND FIGURE A SINGLE LINE 


CAN BE Drawn Between Two Points, DIVIDING IT INTO Two Preces WHICH 
WourLp МАКЕ A SQUARE. 


Fig. 7). An extension of this was to have some of the figures 
reversed mirror-wise or turned through 90 degrees before being 
copied. Since the testees’ responses are creative instead of selective, 
there is a slight subjective element in scoring their correctness. 
Still another type of spatial test is the paper formboard (cf. 
Paterson and Elliot, 1930). The selective-response versions of this 
test, sometimes used in America, appear rather artificial. But a 
creative-response version tried out in the Navy had to be dis- 
carded because of the subjectivity of scoring, in spite of the great 


Fic. 7. FIGURE CONSTRUCTION Test. Copy EacH FIGURE ON TO THE DOTTED. 
Space то THE RIGHT or rr. EacH Ілме Musr START AND END UP AT A 
Dor. BEGIN AT THE Dor Wits A CIRCLE ROUND IT. 
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dvantage that this test can be “got across” to dull E T 
P adik n any of the others discussed here (cf. Fig. 8). À E a 
Sus e contains paper formboard, figure constructio: 

irror- ion items. : 
"The с s of these tests, and the с e 
they measure the same abilities or factors as p em 

Matrices, or performance and mechanical tests, are а ae i 
they were originally designed as intelligence p = (198), El 
Army Beta), many investigations such as Ke ey 


D 
BE FITTE 
Fic. 8. IN THE LEFT-HAND FIGURE, THE Two BLANK Preces CoULD 
TOGETHER AND TURNE 


Koussy’s (1935) and Thurstone’s (1938) have shown that ke 
embody a space factor—named «р» or 'S"— distinct from 

Whose essence (according to El Ko 
anipulation of s 


igations in 
patial relations. Investigations 
the Services similarly showed th 


nd ween. 
at there is little to choose betv 


Dt in mechanical tests. ideo 
- Р. 207) of the vocational valu 
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mechanics in the Forces. Nor did they show any marked correla- 
tion with radar operating or other jobs involving prominent visual 
components. Nevertheless, the occasional successes of Squares were 
particularly interesting. Not only did it show up better when com- 
pared with operational, as distinct_from training, criteria among 
Royal Marine signallers and Coastal Forces ratings, but also higher 
rates in the regular (continuous service) Navy did unusually well, 
and advancement correlated with educational ability chiefly among 
conscripts (hostilities only recruits). In the A.T.S., Squares was 
outstanding among draughtswomen and certain anti-aircraft per- 
sonnel, There was fair agreement with practical efficiency tests . 
in infantry. Finally, Army recruits picked out by psychiatrists as 
lacking in combatancy or questionable in emotional stability did 
particularly badly in the test. This evidence suggests that there is, 
indeed, a rather vague and ill-defined factor of practical ability, 
distinct from g and from mechanical ability, which is measured— 
unfortunately not at all efficiently—by visuo-spatial tests. Shortage 
of psychologists and restrictions on paper prevented fuller research 
ich might have led to the development of better tests of this 
ype. 
Performance Tests 

The Cube Construction test was sometimes applied by Admiralty 
Psychologists to doubtful candidates for mechanic branches, and 
Several performance tests were tried out in W.O.S.B.s (cf. p. 56). 
But all these were regarded more as qualitative than as quantitative 
tests, since they threw light on the candidates’ methods of tackling 
Problems, and no systematic results were collected on large 
numbers. The performance test most widely used in the Navy and 
Army was an adaptation by Trist and Misselbrook of Kohs Block 
Design, in several ways superior to the Kohs (1923), Alexander 
(1935) and Drever-Collins (1936) versions. A new series of designs 
was prepared, and scoring was based on the numbers of blocks 
Correctly placed within quite short time limits. This seldom takes 
more than twelve minutes to give and is highly discriminative from 
en 9-10 year up to superior (though not very superior) adult 

evel. 

Although Spearman and his followers (El Koussy, 1935; Cattell, 
1943) regarded performance tests merely as rather inefficient 
measures of g, containing no separate practical factor, Alexander’s 
and other experiments cited above show that this is not true of 
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adolescents and adults. The only analysis carried out in п Mie D 
indicated that Kohs-Misselbrook Blocks measures much the s 
ilities as Squares, that is + Аог Е. 4 
ана Sea а were devised for both Аеро 
Indian personnel. Though adequate validatory evidence was = 
cult to obtain, it appeared that the most successful tests MOM 
of the conventional formboard jigsaw or picture comp NU 
types (even though the pictures were of objects presumed "E 
to the recruits) but adaptations of tests which worked well in 
country, namely: 
(a) (oru of patterns or series made of strips of coloured 
wood (analogous to the Abstraction test). [осе 
(ф) Reproduction of figures by sticking pegs into holes (clo: 
parallel to Figure Construction), 
(c) Simplified Kohs Blocks. 


Orientation Tests 


A very different type of Spatial test contains verbal problems 


Such as: 


If when Standing on your head you are facing East, in what 
direction is your right arm pointing—N., S., E. or W.? th 
Very promising results were obtained with these tests bo 

among Army trainees assessed for orientation or sense of ше 
and in Predicting the navigation marks of officer cadets in the 5 UN 
avy. They were never developed further partly because the incl? 
sion of sufficient items for good reliability would make them very 
long, and Partly because it is difficult to prevent testees NE 
Merrill scale) they are tested in h 
one such test indicated a hig 


d 
mponent which only overlap Ра 
-tests, i 


Observation Tests j d 
, Three tests borrowed from the U.S, Army Air Force were us® 
in the selection of air crew, Obs-A shows eight large aerial pod 
graphs, on which Various objects or small areas of the terrain af 
lettered. Below each are six smal 
of the large one, and 


i the testee h; 
Priate lettered areas, In Obs- 
Matched with @ppropriate secti 


х out 
l photographs, as it were cut 


NON-VERBAL AND MECHANICAL TESTS 241 


are sub-divided into a dozen lettered districts. Obs-C is an aircraft 
identification test. Each item shows a silhouette of an aeroplane in 
the left-hand margin and five other similar silhouettes with the air- 
craft pointing in various directions. One out of the five has to be 


© 


(1) Each of the jars has four 
holes down the rignt hand 
side, Which picture shows Q2) Ball X hits ball Y at che 
how the water will run out? point P. Which ball will Y 
(^, B, C, or None). hic first ? (A, B, C, D or E). 


Fic. 9. MECHANICAL COMPREHENSION TEST ITEMS. 


Chosen which is identical (apart from direction) with the initial 
Silhouette. M 
These tests provided a good illustration of the dangers of judging 
from face validity. A and B in particular look as though they would 
be valid. But, though they did correlate significantly with initial 
and basic training marks among navigators, the coefficients were 
disappointingly small. In a factorial analysis all three tests were 
found to embody g + a group factor, distinct from k, and this 
- might, perhaps, be identified with the P or perceptual ability factor 
claimed by some American psychologists (Dvorak, 1947). More 
probably, however, this component is a result of training, some 
applicants for air crew having had more experience than others in 
reading maps and photographs and in aircraft identification. 


Mechanical Tests 

The tests chiefly used in the Forces were: , 

S.P.2, Bennett Mechanical Comprehension and revisions thereof. 
Some forty to fifty pictures are shown of mechanisms (e.g. gear 
wheels), or mechanical situations in everyday life (e.g. trains going 
round bends, men lifting weights, etc.), and selective-response 
questions are asked, as in Fig. 9. z 

A new Practical Problems test for the A.T.S. is based on pictures 
and questions about cooking, clothes, motor-cars and other things 
of which women might be expected to have had experience. 
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Mec-B, Mechanical Comprehension (Diagrams).—Eight diagr ams 
of machines are shown, and several selective-response questions 
are asked about the workings of each. This test has gas 
yielded better validities in the R.A.F. among pilots, navigators an 
flight engineers than has the pictorial Bennett test, Мес-А. 

5.Р.117Е. and M., Electrical and Mechanical Information.— 
Effective ten-minute tests were constructed containing twenty-five 
electrical and thirty mechanical items of the following, simple- 

completion, variety: 


The terminals in a metal electric lamp holder are usually 
mounted їп.. 


Sponse tests of mechanical information (Mec- 
C) and aviation + engineering information (Gen-D). The latter 
obtained Some of the highest Coefficients of any test among pilots 
and air gunners, and was also the best single test in the U.S.A.A.F. 

Correlations between comprehension and information tests are 
so high (approximately -7 in unselected groups) that they are 
clearly measuring much the same ability. Thus items of both types 


ze ions 
W naval recruiting centre tests, Question 


d group (cf. pp. 31-2, 45). ining 
| “A test of the Stenquist type, containi 
nine sets of parts to be assembled, was devine for the Army. Only 
Parts which could readily be duplicated from Army stores were 
ded. It was so arranged that one tester ‘could test eight 
multaneously In about thirty-five minutes. 
3 sr" required an easier mechanical test and one was 
coed which Involved Stripping and reassembling seven Meccan? 
me : rs the aid of full-sized Pictures (S.P.24), In the Navy» 
est of ability to bend a Piece of wire with pliers into a given shape 
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was used in several investigations (S.P.103), but it was not applied 
regularly because the scoring against a “quality scale” was some- 
what difficult. These tests, too, could be given to small groups. 
Cox’s or Vincent’s models tests of mechanical comprehension were 
applied in certain naval and Army investigations, and in the R.A.F. 

All the above mechanical tests and some of the spatial ones were 
Shown to be of value in the Forces in the selection of apprentice 
tradesmen at 14—15 years, and to add considerably to predictions 
based on academic examinations and intelligence tests, thus con- 
firming the results of Hunt and Smith (1945) and Holliday (1943). 
We have already described their disappointing validity among 
male adults and attributed it, partly to poor reliability, and partly 
to the distorting effects of the varied mechanical experience 
possessed by recruits from non-mechanical civilian occupations as 
well as by tradesmen. Models tests appeared to be even less . 
Successful than paper-and-pencil or assembly ones. Since the 
Specifically mechanical content of the latter was of little use, they 
acted as tests of all-round practical ability in much the same way as 
—or better than—spatial tests. Thus the assembly test obtained its 
highest validity coefficients (averaging -36 and ranging up to +58) 
when compared with tests of elementary training among infantry, 
assault course marks in R.A.C., and proficiency in demolitions, 
field work, bridging, etc., in the R.E. 

Given more paper and materials, and more staff, better tests 
might perhaps have been developed. An important principle estab- 
lished in later investigations was to choose test items which contri- 
buted most to mechanic selection over and above g and education, 
In order to cover different ground from that predicted, say, by 
Arithmetic and Instructions. But mechanical ability is itself so com- 
plex that the search for a test which might be useful for professional 
engineers, for radio technicians, for fitters, for garage, hands and 
for machine operators, does not seem very hopeful. Thus Admiralty 
and War Office psychologists came to rely more and more on 
Straightforward information tests. Tests like S.P.117E. and М. 
Were often found to be more predictive than either Bennett or 
Arithmetic. Tests of trade knowledge were particularly useful in 
auging the extensiveness of experience claimed by men who had 
already entered a civilian trade, and its relevance to Service jobs. 
While it is true that these are paper-and-pencil tests, they are much 
More reliable than the conventional practical trade test, and it was 
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А of 
sometimes possible to show that they give better ae 
success in passing mechanic or electrician training е7 iis v 
tions were sometimes raised that mechanics selecte in en 
might be lacking in the “practical knack,” but it is ex шс, 
dubious how far such an ability really exists. The evidence а: d 
by psychologists appears to show, rather, that men "d Em 
widely different degrees of practical ability in dier мен. 
the best basis for prediction consists of records of wor nani 
done. This problem of the nature, and consistency, of mec 
ability will be discussed more fully elsewhere. 


1 Illustrative Experiments 


\ 
‚ the 
The following validatory experiments illustrate some of 
points made above, 


' : veen 
Radar Operators. In 1943 correlations were obtained betv 
the standard naval tests and 


Success at the training course RAE 
603 radar operators, They are shown in the first column of i in 
XXXVII, Corrected for selectivity. This is the only naval stu A m 
Which the coefficient for Matrices surpassed that for T2, tongi © 
equalled it among visual signallers, It was shown, too, that Маїг 


RS 
TABLE XXXVIT- Typ VALUE oF Tests IN SELECTING RADAR Ona 
E ions in: 
Test Correlations 1945 
ees 
0 Matrices , ; z - Р sz 42 mS 
l Abstraction > Д < р 3 “34 es 
Bennett ; Ў Ы Е "28 122 
За Arithmetic, s A à ч Š "25 28 
3b Mathematics A E , i У "39 Bp 
4 Squares D 5 б . H *20 М 
re се a күтүш Sy Т. `39 ia 
Teup Test айталы" з з: d 
119 Scale and graph reading , б ‘ Ey 
:emory for desi. Е E З 22 
109 Figure Construction . ў 3 2n 
118 Oscillosco 


Ре reading , б * е 21 4 
f wh 


> Whereas Mathemat; 
end. Fai ing the 


dar sets and the nature of the training had gr' eatly 
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altered, and in a fresh experiment the standard battery along with 
several new tests was given to 411 radar plot ratings. The (uncor- 
rected) correlations with practical marks are shown in the second 
column of Table XX XVII. Although Mathematics is now relatively 
more important, and bility to read scales and graphs gives much 
the best predictions, Matrices still does quite well. Group Test 70 
is similar. Two additional k-tests are better than Squares, but do 
not add appreciably, and a U.S. Navy oscilloscope reading test, 
based on discrimination of pictures of radar patterns, is of little 
value. 

Army Apprentice Tradesmen.—Approximately 1,000 boys who 
took examinations and certain batteries of S.P. tests on entry at 
14 years were followed up over periods of one to three years, during 
which they had been trained as fitters (general, motor vehicle, or 
gun), armourers, electricians, instrument mechanics, carpenters, 
masons, or in other smaller trades. The courses were sufficiently 
similar, and the correlations in different trades sufficiently uniform 
to justify the quotation, in Table XXXVIII, of average figures. 


ТАЕ XXXVIIL—THE VALUE or Tests AND EXAMINATIONS IN SELECTING 
р APPRENTICE TRADESMEN 


Examination or Test — | Correlation Mechanical Test Correlation 
Arithmetic entry exam.. 15 2 Bennett . = » “29 
English entry exam. | . 08 4 Squares - è Р "29 
S.P.3A Arithmetic А 13 8 Assembly е 5 27 
S.P.3A Mathematics . *20 (Meccano Assembly . +22) 
S.P.17 or 25 Verbal . 16 (Mechanical interests . :37) 
(S.P.21 Instructions . *32) 

Matrices, 4 15 
Oral directions . j 13 


While all coefficients are small, this was due largely to the unreli- 
ability of. the available criteria. When divided into more and less 
reliable sets, the multiple correlations with the criteria were +48 
and -30 respectively. The tests listed in parentheses were applied 
to some 300 boys only, for whom the criteria were exceptionally 
good, hence their coefficients are not comparable with the rest. 
Bennett, Squares and Assembly are the most generally useful tests, 
along with Mechanical Interests—a test on the same lines as 
Strong’s Vocational Interest Blank. Arithmetic is of some value, 
but general intelligence, Directions, and verbal-educational tests 
(with the possible exception of Instructions) have little to con- 
tribute. The correlations for examinations are unduly low since 
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these examinations were actually used for selection, but correction 
ises them only slightly. 
j^ А similar silly with 550 naval artificer apprentices showed that 
the following tests have the best validities: 
S.P.2 Bennett, 117E. and M. information. 
Medium validi coefficients were obtained by: a 
S.P.97 М; for Designs, 4 Squares, 3b Mathematics, 
7 Kohs Blocks, 110 Cox models and 103 Wirebending. 
Very low validities were obtained by: МЕ? 
S.P.1 Abstraction, 3a Arithmetic and the entry examination. 
Radio and Electrical Mechanics and Wiremen (L).—Several infor ш 
Mation tests were given to some 300 naval radio mechanic чаша 
214 electrical mechanics and about 100 wiremen (L)—that i 
civilian electricians who had passed a practical trade test on к, 
into the Navy. Correlations with training results are shown 
Table XXXIX. Test 84 is a test of trade electrical knowledge; 


99 a test of knowledge of radio Symbols, and 117E. of everyday оГ 
amateur electrica] information, 


D 
TABLE XXXIX. Tu VALUE ОЕ Tests IN SELECTING ELECTRICAL АМ 


DIO TRAINEES 


Correlation among: 


Test Radio Electrical | Wiremen 


Mechanics | Mechanics n 
DT | ане) 


41 
ars "29 51 . 
2 Bennett : t . 2 0 
3b Mathematics z © И 3 ee 4 D 
84 Electrical trade knowledge К i +33 "33 ae 
99 Radio symbols E у $ 5 *35 "29 узы 
1E. Веста! information ©): | 35 46 p 
€chanical i; i T . * К 
Initial trade КЫ, é кан : Ў = E 208 
.S.0.s? Judgments of Suitability ` : 33 29 ks 


АП three information tests 
radio Mechanics, With electr 


еге found in othe 


(direct entry) and cinema proj 


CHAPTER XV 
SPECIAL APTITUDE, AND TEMPERAMENT, TESTS 


Abstract.—Descriptions are given of tests of physical agility, and 
of auditory capacities including aptitude for morse; also of tests 
involving apparatus for gunners, radar operators, aircraft pilots 
and other special groups. The results obtained, particularly with 
the simpler tests, were mostly disappointing, and it appeared that 
more successful selection could be achieved with the aid of paper- 
and-pencil tests and *work-sample" methods. Some progress was 
made with the selection of recruits suitable as instructors, but very 
little in picking those with good or poor personality and tempera- 
mental qualities, such as leaders on the one hand, and emotionally 
unstable or psychiatric suspects on the other hand. Recent develop- 
ments in personality testing are reviewed. Projection tests and 
other indirect methods are valuable in skilled hands, and batteries 
of objective tests for certain “dimensions” of personality such as 
general neuroticism are feasible, though too elaborate for large- 
Scale use. Neurotic inventories or questionnaires gave good results 
in the Forces, but would not necessarily work so well under civilian 
conditions. However, these, together with group projection tests, 
and interest questionnaires, deserve further investigation. 


A number of specialised tests were tried out in the Forces and 
а few were used regularly, though none was outstandingly success- 
ful. Some of the results obtained with Chleusebairgue's and other 
motor driver tests, with auditory tests for asdic operators, and with 
the physical agility test in the Army, have already been quoted. 
The latter (S.P.16) was measured by the time taken to transfer a set 
of Steel rings from two upright posts to two others standard 
distances apart, the mean time being approximately one minute. 
The test was devised to resemble the duties of gun numbers and 
infantry, and did yield significant but small correlations with pro- 
ficiency among the latter (averaging 22). But it was very unreliable 
being much affected by the floor surface and the recruits’ shoes, by 
their state of health at the time, and by the amount of enthusiasm 
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end 
tester was able.to stimulate. Even this test was Кы, ко 
ee extent on g. It had a certain propaganda val = Бао 
ШЫГ else. Several dexterity tests were tried out cia ‘A figi 
branches of the Navy, Army and A.T.S., with no po o Lu 
dexterity test (Co-ord.-C) uim is being used in 
described by Cockett (1947). m- 
Sr н аал was devoted to tests for MEAS, E 
it was hampered by the extreme unreliability о аа. reall 
criteria. Instructors and officers could not say which re 


gars : tests were 
` Were good or poor gunners, and objective proficiency 


АГ ituation 
tuled out by lack of time and ammunition. Two Ee. 
tests—Hotoph’s layers test and Craik's mo e La aintain the 
ing results, but it was impracticable to construct an cele Aen 
apparatus needed for testing, say, a 1,000 men a we "ented 
A.T.S. recruits in anti-aircraft batteries, a set of we vi 
motor tests gave correlations ranging from +-165 to — 


trices 
cy at a practice camp, whereas eed. 
th the same criterion, and Table dicc. 
pencil tests yielded even better pre 


men 
gunnery school as was T2 (correlations of 47). Of forty-three 


ne 
receiving recommendations at their seaman establishment E a 
(2-3 per cent.) failed, whereas of 162 not recommended, mple 
cent. failed, This Suggests that Paper-and-pencil -+ work-sa 
testing constitutes а 


è nners 
better approach. to the selection of gunne: 
t aptitude testing. 


Chapter XVI Shows that the 
of aircraft pilots, 


ТИЙСЕ; З р tro: 
€ can adjust its Vertical position by a con 


sition by a rudder bar, so that Lu 
and feet are tested. The runs d 
scored by the amount of time the light falls outside a central papan 3 
~ord.-B) consists of a ro d 
winding road with “kerbs” an 
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“obstacles” punched in it. The testee steers a bronze ball along 
this road by a hand steering wheel. Each time he fails to avoid an 
obstacle an electrical counter is operated. The wheel is so geared 
to the ball that motor anticipation is essential, since rapid move- 
ments at the last moment cause it to wobble violently. The U.S. 
Army Air Force, with its much greater resources of personnel and 
materials, was able to develop several other apparatus tests possess- 
Ing useful validity for pilot selection. Nevertheless, its paper-and- 
pencil tests generally gave better predictions (cf. Flanagan, 1946; 
Davis, 1947). Similar work was done by Biesheuvel for the South 
African Air Force. 

In yet another experiment on naval radar operators, two tests 
Were found to add appreciably to T2 in the prediction of training 
marks. The Echo test, devised by Craik, provided a mechanical 
representation of the “scan” of a commonly used radar set and 
Was scored by means of the smallness of the echo, or peak in the 
tracing, which the testee could place correctly. This had to be 
abandoned, however, since no method could be found for convert- 
Ing it into a group test. The other test, Pointers, required testees 
to gauge the positions on a scale to which two arrows were point- 
ing. Forty such items were shown on large cards at an increasing 
rate, the object being to observe the testees’ reactions to stress, and 
their tendency to breakdown or “flap.” Whether this really acted 
as a test of temperament could not be determined, but it certainly 
depended to a considerable extent on mathematical ability. Hence 
the scale and graph reading test, mentioned above, was developed 
Instead, 

Morse Aptitude and Auditory Tests 

The Morse Aptitude test (S.P.10) used for selecting Army and . 
ATS. signallers during most of the war consists of seventy- 
eight pairs of sound patterns, presented by gramophone and head- 
Phones. Testees judge whether the two patterns in each pair are 
the same or different, This is an American test—the U.S. Signal 
Corps Code Aptitude test—dating from 1918. Since its reliability 
18 inadequate, American psychologists sometimes trebled its 
length, but later discarded it in favour of morse learning tests, The 
US, Army “Speed of Response" test has now been adopted by 
the Royal Navy, and was proved in an Army investigation to 
Possess superior validity. It, too, is given by gramophone records. 
The testees first get instruction and practice on three actual morse 
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patterns for seventeen minutes, and are then tested for the аш 
ness and accuracy with which they can receive these patterns, a 
about ten minutes. Naturally recruits with previous eee 
have an advantage, hence it is supplemented with a short recor 
testing receiving ability at four and six words a minute. "m. 
The old Morse Aptitude test occasionally gave correlations о dd 
to :35 with receiving proficiency after training, but in ws A 
follow-up its validity was lower and could be attributed ш Y 
its dependence on intelligence. Correlations of Arithmetic, Cler a 
and Spelling tests with proficiency were usually superior. It y 
particularly defective in that its correlation was better at gi Р 
than at the bottom of the scale, i.e. it was fairly efficient in di j n 
entiating potentially Very good from average telegraphists, bu 


TOY А orse. 
almost useless at picking out recruits who could never learn m 
Even the new morse 


validity coefficient of -4 


twenty-fifth week (20 
Several trainees with v 


lasted forty-two weeks, and among Army 
hardly seems likely that any test Hed 
be expected to give good predictions 


3 m : Һе 
Ones, with decreasing intensity. TÉ 
opelessly inadequate, and in only E 
Tators—was it ever shown to have а 


E. f 
* The reliability : he methods 0 
testing proficiency, However, in R AT. by the inadequacy of the 


р jective 
AF, ions the American objective 
Code Receiving test; Were empl d hin gations t individual var! 
ability was still four " mployed, and highly significant indi 
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been adopted by several Education Authorities for identifying 
hard-of-hearing children, it is to be hoped that careful investiga- 
tion of its value will be made in schools before any reliance is 
placed on its results. Two gramophone tests developed in the 
R.A.F. are briefly described by Dickson (“Brit. Med. Вии”, 1947). 
One involves counting the numbers of pure-tone pips sounded at 
decreasing levels of intensity; the other requires recognition of ` 
words heard through a background of engine noise. 


Selection of Instructors 

Investigations were made in all the Services into the selection 
and training of instructors, those in the R.A.F. being the most 
extensive. Youthfulness, intelligence and education were found to 
be of considerable importance. Thus the Mathematics test 3b 
correlated +45 with assessments at the end of training among 161 
naval gunnery instructors; and among driving instructresses in the 
A.T.S., instructions, spelling, the “Mec” test and previous educa- 
tion yielded good results. But a particularly effective miniature 
Situation test was developed in the latter research. This consisted 
in giving the candidate a Meccano model to study, and then having 
her explain to a “stooge”? how to make the model, given only the 
Separate Meccano parts. Ratings by a psychologist who listened to 
the explanations of 146 candidates were considerably more pre- 
dictive than ratings, based on interview only, by a Р.5.0., a 
psychiatrist, and a mechanical transport officer. Further work con- 
firmed the value of this method and demonstrated that it could be 
applied equally successfully by specially trained P.S.O.s. But 
other. ; more objective, tests which were tried out as alternatives to 
this subjective rating showed no promise. 


Personality and Temperament Tests 

In Chapter IV we described the importance attached to per- 
Sonality qualities among naval and Army officer candidates, 
together with the interview and group task methods developed for 
assessing them. It would have been extremely useful to possess 
reliable tests which could be applied to all recruits, both for screen- 
Ing cases with suspected neurotic or psychopathic tendencies (who 
Could be sent for interview by psychiatrists), and for judging 
leadership, industriousness, and the like, which were clearly of the 
greatest significance in all naval and military employments. Even 
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full 
if the. worth of the W.O.S.B. techniques had с. Ет a 
established, it would have been impracticable to e Med 
very large numbers. Hence the assessment of e e T уе 
almost entirely on P.S.O.s? interviews, whose inadeq dem 
already admitted. The difficulties of personality anim din Wm 
been described, and these are enhanced when н ney nd 
that the tests must be group ones, or—if individual—v ху bari 
and when a considerable proportion of the testees а: eon 
moderately skilled. Little ш NC. 
y qualified psychologists; thus, 

very far. Р 

a ies work was carried oe 
borators at the Mill Hill Emer n) 
his main methods and results wi 2 
f factorial analysis, experiments x 
Army and A.T.S. patients, he Me. ^ 
of two major dimensions or facto 


s Jd of 
personality, comparable to & and v sed v. k:m in the fie 


abilities. These are: 
(1) General neurotici 
and Poorly-organised Personality. hich are 
(2) Extravert о, introvert tendency, the extremes of w obses- 
represented by hysteria and dysthymia (anxiety or 
sional Neurosis) respectively, 


t 
; : exten 
A number of tests were found to differentiate to some 
between hysterics 


and dysthymics, or between maladjusted ар 
normals. The latter included: ions as: 
(i) A Personality questionnaire containing such questi 00 
Have you ever been off work through sickness а £ 
deal? үе; No 


h N о. 
tto make friends? Yes. 


justed 
Sm, or stable and integrated v. maladj 


The 
Surveyed by Vernon (1938a) support SR 500 
mean scores on forty questions among 300 normals and оу 
Deurotics were 9.6 and 19 
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(ii) Body Sway. (a) Static Ataxia. The testee stands upright 
with his eyes closed, and a thread is attached to the back 
of his collar, and connects with a pointer which records the 
amount of sway on a smoked drum. Among 120 normals, 
none swayed more than 2 inches in 30 seconds, whereas 
343 per cent. of 900 neurotic swayed 2 to 6 inches or more. 

(b) Suggestibility. A gramophone record is now started 
which reiterates the suggestion: “You are falling, falling 
forward. . . ." Sway is generally increased and some 
60-70 per cent. of neurotic patients now move more than 

.... 2 inches, but only 5-10 per cent. of normals do so. 

(iii) Dark Vision. The Livingston rotating hexagon test, used 
in the R.A.F., showed that neurotics tend to have much 

.. poorer dark adaptation than normal recruits. 

(iv) Rorschach Inkblots (Harrower's multiple-choice group 
test). The ten inkblots are presented by slides, each with 
nine possible responses, some commonly chosen by neu- 
rotics, some by normals. The testees rank these in order of 
appropriateness, and the sum of their rankings of the 
neurotic responses constitute their scores. 

(у) Word Association (adapted from Malamud, 1946). Two 
possible associations are supplied for each of fifty words, 
one characteristic of normals, one of neurotics. The score 

,. 1s the number of neurotic responses preferred. 

(vi) Perseveration Tests. It was confirmed that normals tend 

to obtain moderate scores and neurotics very high or very 
я low scores on ordinary motor perseveration tests. 

(vii) Persistence. In one simple test (similar to that of Fernald, 
1912), the testee sits on a chair and keeps the heel of one 
shoe about an inch above the seat of a second chair as long 
as possible. The average times for hysterics and dysthymics 
were fourteen and thirty-one seconds, and for normals 

|... Over one minute. 

(viii) Personal tempo or normal speed of work on such tasks as 
manual dexterity or co-ordination tests was found to be 
slower among neurotics. Good results were obtained with 
the O’Connor Tweezer Dexterity test and the Track 
Tracer, where the subject traces a path with a metal 
stylus between rows of holes on an ivorine sheet, and a 
buzzer sounds each time a hole is touched. 
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Eysenck has advanced some way towards the production 
reliable batteries of tests for measuring neuroticism and e B. 
dysthymia, but not all the tests mentioned would be ves еа 
large-scale application, and evidence regarding the validity of s 
of them is as yet hardly conclusive. 


“Indirect Tests 


will always 
favourable | 
writer have 
the subject 
in which h 
was the pri: 
African A 
carefully trai; 
ally prepared 
methods of 
judgments h: 


or 
Moreover, several aspects of performance at sensory-motor 


i d ere 
which could be objectively measured, E ar 
mperamentally significant. When, however, sim. 
methods were 


tried out in the selection of U.S.A.AF. pilots, ү, 
results were generally disappointing (cf. Davis, 1947; Guilfor 
1948), 


Another approach to as 
adopted by the Cambridge Unit of Applied Psychology. It W 


considered that temperamental stability or tendency to “Вар” ж 
be shown by the trend of performance in tests which impose рг st 
longed stress оп the subject. One of these was the Pointers t€ К 
(р. 249), another the Track Tracer, applied at maximum speed £0 


1- 
third Was a modification of the McDougal 


аге recorded, and from the up E. 
downward trend or the irregularity, a somewhat subjective айу 
ed. Adequate evidence as to the уа rd 
not yet published, and they would ha 


: 3 was. 
the investigation of, temperament 
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appear suitable for large-scale use, both because they involve a 
fairly lengthy session of individual testing, and because they could 
scarcely be entrusted to semi-trained testers. For the same reasons 
no attempt was made in the Forces to apply projection tests except 
to relatively small groups such as officer candidates, although the 
original Rorschach inkblots, thematic apperception, and other pro- 
Jection tests have come to be regarded, in recent years, as the most 
promising techniques for the diagnosis of personality. It was noted 
too, that tests of the projection type tend to be unpopular both 
among testees and employers (e.g. Army officers) because of their 
apparent lack of relevance to the job. The psychologist cannot 
afford to disregard this factor of “face validity” completely. 
Another test providing measurable indications of such traits as 
calmness, foresight, initiative and persistence is Cattell’s (1941, 
1944) C.M.S. or cursive miniature situation test, which is an ela- 
boration of the dotting test. He gives striking evidence of differ- 
entiation’ between delinquents or other unstable personalities and 
Normals. Though it failed to live up to its promise in a small trial 
With W.O.S.B. candidates, this may have been due to our inability 
to construct suitable apparatus and dotting sheets. It suffers, 
OWever, from a serious drawback, namely, that the scoring 18 
extremely tedious and time-consuming. Only if this could be 
Simplified or automatic electric scoring introduced, would it 
come a practicable test. As mentioned above, it is possible that 
the Clerical or Instructions test is another indirect measure of 
Certain temperamental qualities, comparable to Biesheuvel’s, the 
Cambridge, and Cattell’s tests. Р 
D 4 
Tests Tried Out in the Army 
As already indicated, no precise data are available on the value 
9f the Projection tests used at W.O.S.B.s. An early experiment, 
Owever, revealed significant differences between the grammatical 
forms of response to the group word association test among fifty 
Company commanders, fifty W.O.S.B. passes and fifty fails (of 
equivalent intelligence level). For example, the mean percentages 
of complete sentence responses were 41, 31 and 19 per cent. 
respectively. No simple method of scoring either the form or con- 
ae of responses could be devised, which could be used by un- 
Skilled testers, except the number of blanks or failures to respond. 
1S was tried out at an Army selection centre on 218 recruits, who 
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= eramental 
were interviewed by psychiatrists and азы үн 
stability ©. neurotic tendencies on the Cu Ый 5 a to defer 
whatever was found, and failures to respond app (eg. the pé 
chiefly on illiteracy and on the testing props Pe 
ticular tester). By contrast a “medical EI tee Ind d 
Eysenck's but based on Part III of the Cornel br ачен 
(Weider, 1944), gave an agreement represented | is и, 
correlation оЁ-31 with the Culpin scale ratings, which i 

omising. Д Sina 
Pr The Jae Rorschach (ranking) test was given to ph. 
selection centre, fifty of whom had been picked А as iter 
personality problems by the P.S.O. and ada ee pep. 
mediate, and fifty as being the most stable and well a ds M 
group. The median scores were 215, 223 and 234 re es MS 
Corresponding to a correlation ratio of 26 (note that " SENT 
indicates instability). If a borderline mark is fixed at 200, t. of the 
cut off 25 per cent. of the most neurotic and 12 per T end should 
remainder. Such differentiation is far from adequate, m 1 neal 
be pointed out that the testees were a selected group. In g ene 
recruits are not sent to a selection centre unless they are pa id 
degree maladjusted, yet, at the same time, few are likely ER that 
serious neurotic state, Thus this result does not confie МБ ы: 
of Eysenck who obtained median scores of 205 and 23 E 
neurotics and normals. The test was found to depend ME 
extent on g and v:éd (giving correlations of :33 with ok pre- 
and Spelling), However, with the revision of the sa г be 
sented to the testees, it might in combination with other Hie on 
useful for Screening Purposes. A new inkblots test, suita 


t been 
group application, was Prepared by Harvey, but has not ye 
validated, 


Is 
; = 5 norma: 
It may seem curious that neurotics should differ from Б 
оп 5 


[се xterity: 
€nsory and motor tests such as dark vision and d) claims 
though not on tests of mental abilities, But Slater (1944) © 
that temp inferi 


2 ОСІ 
У would be expected to be assoc! 
With constituti ical i 


exceeded the normal median Score. 
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Some experiments were carried out on gradings of recruits by 
their own fellows, which appeared to show that they, were capable 
of nominating promising candidates for commissions, who had not 
been picked up by the usual channels. Unfortunately, this tech- 
Nique was received with some suspicion. Subsequent American 
work has shown that gradings by fellow cadets at officer training 
schools give quite good predictions of success in battle a year or 
two later (Baier, 1947). The correlations of -42 and :51 are dis- 
tinctly more promising than correlations obtained for W.O.S.B. 
or for O.C.T.U. grades. This suggests that it might be worth 
taking fellow employees’ opinions into consideration in managerial 
and other civilian selection. 


Personality Inventories and ‘Medical” Questionnaires 

Evidence regarding the validity of these questionnaires in 
Civilian practice is decidedly unfavourable (cf. Vernon, 19384; 
Ellis, 1946), and one would naturally expect them to be more 
unsuitable with low-grade, semi-illiterate recruits. Moreover, they 
tend to arouse suspicion among naval and military officers. Never- 
theless, they achieved surprisingly good results both in America 
and in this country, especially when given with medical backing. 
Conrad (1947) suggests that this was due partly to the greater . 
heterogeneity of recruits than of most civilian samples, and partly 
because the questions asked are very similar to those which. 
PSychiatrists ask when they are assessing the neurotic tendencies of 
the testees, Possibly, too, average and sub-average men and women 
accept the questions more readily than sophisticated university 
Students, on whom much of the earlier work was performed. Yet 
another factor is that civilians usually try to make themselves out 
aS normal as possible, whereas recruits may adopt the opposite 
attitude. 

Two main types of validatory criteria have been emi 

Neither of them very satisfactory: 

(i) Assessments of neurotic tendencies in a miscellaneous group 
by psychiatrists, 

(ii) Disce Deu between persons who have already developed 
neurotic breakdown and others who are presumed to be 
normal. Naturally, it does not follow that the neurotics 
would have given the same answers before they reached the 
stage of hospitalisation or discharge. 

P.8.—9 


ployed, 
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i alue of the 
Nevertheless, one American study cim her zi 
National Defence Research Council's personality 


i 1 геге discharged - 
differentiating recruits who, in the following year, were 


/chiatric 
or committed offences, or were promoted. A E MEUM а Harve 
follow-up of the Bennett-Slater ur параи pw 
designs (inkblot) test has been undertaken in t E "e NUS 

While adaptations of the Cornell index an rts in NB 
inventory (short format) were “got across” to ep аз 
country quite effectively, the most successful of suc he nauim 
published by Bennett and Slater (1945), where ма с: 
answers are ingeniously concealed. This is in ten sectio T вше 
which gave significant differences between neurotic mee: of 
patients at Sutton E.M.S. Hospital. The first three E 
ordinary questions, in about half of which a positive ans E 
cates neurotic tendencies, in the remainder a парвое с 
Thus the testee who wishes to misrepresent himself ee n 
answer “по” throughout. The sections deal with oe ate wae 
anxiety, hysteria and depression, respectively. The miM 
the least satisfactory. The Next four consist of lists of “а 


: z : These are 
ances," testees checking each item they find annoying. 
classified as follows: 


(1) Frustration of self- 
to do your job.” , in 
(2) Personal inadequacy, e.g, “You forget what you're looking 
for.” 
(3) Dirt or untidiness, €.g. “An unmade bed.” 
(4) Noise, e.g. “The sound of hammering.” А оу алу 
Nos. (1) and (3) are regarded as things which might Wes affect 
normal person, but Nos. (2) and (4) are much more likely t core 18 
neurotics. АЦ types are mixed in the test blank, and the s 
based on the difference between them, t test, 
he last three Sections are revisions of Pressey's cross-ou 
where the Subject crosses out items: ding." 
(1) Forwhich People should be blamed, e.g. “Flirting, SP ing” 
(2) Which t ey have worried about, e.g. “Loneliness, all 
(3) In which they are interested, e.g. “Football, comedia rotics, 
ets items are chosen as likely to affect nev 
and in the third as likely to appeal more to wee ighte 
Slater shows that when the Sections are арргорг iately ve an 
Score will cut off 6] per cent. of neurott 


how 
assertion, e.g. “Somebody tells you 
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only 2 per cent. of normals. This needs, of course, to be confirmed 
on another, larger, group. A defect of the test is that it takes nearly 
an hour to give. Moreover, the scoring is so lengthy that it could 
never be used as a routine test. However, some of the sections and 
some of the items are more diagnostic than others, and a more 
practicable revision might well be constructed. American experi- 
ence suggests that a short twenty to thirty item test is as effective 
as a more elaborate instrument*. 

Another questionnaire tried out successfully on motor drivers 
(cf. p. 230), contained fifty miscellaneous questions such as: 

Can you ride a bicycle? 

Do you think all cars should be restricted to 30 m.p.h. ? 

Do you always take a torch with you in the blackout ? 

Do noise or vibration in a bus or train ever give you a head- 
ache? 

5. Do you prefer indoor to outdoor games? 

All of these differentiated good from bad drivers, perhaps for a 
variety of reasons. No. 1 may indicate “machine sense.” “Yes” 
answers to Nos. 2 and 3 suggest undesirable nervousness, while 

os. 4 and 5 are of the conventional psychosomatic or neurotic 
type. Such questions are less embarrassing than those in most 
Personality inventories, and their object is not at all obvious—some 
of the “good” answers being “Yes,” some “No.” Though the 
Scoring is relatively easy it was the time taken over this and over 
giving the test which prevented it from being more widely used. 


me eo Ro Es 


Interest Blanks 

Strong's Vocational Interest Blank and Kuder's Preference 
ecord have been shown, in America, to give useful predictions of 
Suitability for a number of professional and commercial careers. 
Probably they are much less effective among average and low- 
grade adults, for at these levels the responses to interest items 
pend to so large an extent on intelligence and education, or else 
9n temporary fashions. For instance, the interests ticked by a 
Skilled instrument mechanic might resemble more closely those of 
?n equally intelligent clerk than they would those of a lower-gr: ade 
meta] Worker, say a welder. Nevertheless, a test consisting of a list 
of jobs and leisure-time interests was standardised in the Army on 
* CE. Stuit (1947). This book contains an excellent discussion of the con- 


Structi ОК ‹ у 
tion and uses of personality inventories. 
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2 ы ons— 
recruits representative of the various training cone ae 
drivers, mechanics, clerks, etc. Thus any new и de 
could be scored for their resemblance to those of me: “ОП. 
these types of employment. Though the scores had poo dba 
they showed useful predictive value in several experime - nd fe 
test was abandoned mainly because of the time s higher 
scoring. A test of this kind might have been useful wit Mm 
grade population such as officer candidates, bar P nd Arm 
studies with passed and failed candidates in the Navy a me fn 
did not reveal many consistent interest differences. The sa n co 
ing was reached in work with U.S.A.A.F. pilots, and it w "EL. 
cluded that aviation and other interests are better asses (1948) 
information tests (Guilford, 1948). Probably, as ew. 
Suggests, the type of interest is of less importance than the E 
ness with which it is pursued, and it is difficult for а Рае 
Paper test to elicit this attribute for a large number of inter | 


Conclusion 
Our conclusions г 


in peace-time have 
How about person: 


3 ille 
application are not very bright. When ample time and ski 


2 b í 
personality would appear to uld Þe 
etter interviewing techniques, which co 
ised among different interviewers. 


CHAPTER XVI 
MAIN R.A.F. SELECTION FINDINGS 


Abstract.—This Chapter gives detailed information on the follow- 
Ing topics: 
y im training wastage before the introduction of grading. 
x elation between speed to solo and training achievement. 
A Quarterly pilot training wastage from 1940 to 1943. 
‚ Relation between grading score levels and elementary flying 
pass rates. 
- Layout of the original grading score card. 
Ч Use of the raw grading scale by a number of different schools. 
. Relation of grading performance to subsequent accident rate 
Е in pilot training. 
ч Боп of grading to age differences and failure rate. 
- Composition of the original (April, 1944) air crew aptitude 
MERC battery: 
n SU of tests in the aptitude test battery. 
. alidation of aptitude test category batteries against different 
, 1 training stages. 
- Interpretation of aptitude test 
n ра поа and Service requirements. x^, 
M fee and character traits deemed importantin air crews 
iP in iability of the above trait assessments. 
- Average intelligence test scores for selected 
trades. 
16. Multiple correlations between G.V.K. scores and training 
results for a variety of R.A.F. and W.A.A.F. trades. 


Dn 


results in relation to expressed 


R.A.F. ground 


The purpose of this Chapter is to present the findings on R.A.F. 
be of general interest. So 


ee 
ae methods that are most likely to 

{н as possible the Tables will be left to speak for themselves, but 
ertain amount of verbal clarification will usually be necessary- 


Th Air Crew Selection 
C e chart (p. 263) shows what had happened aft 
volunteers who had been accepted for pilot training by 
261 К 
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Candidates Selection Boards. Out of every 100 cadets eps ^ ^ 
ing there were 41 who had qualified for. operations, marly 
qualified but were engaged on non-operational duties (р! eh Бай 
instructors), 6 who were still under training, and s Ыт cm 
failed to qualify as pilots. It is clear that wastage was hea NA 
centrated at the elementary flying (E.F.T.S.) stage which w. 

used as the main means of pilot selection. 


TABLE XL.—RELATION BETWEE 


CHIEVEMENT 
SPEED ТО SOLO AND TRAINING ACHIEVE 
Total cases: 


-F.T.S.s, 36 courses, N = 2,292 


Did not solo = 554 (24%). Did solo = 1,738 (76% 
Hois Sample Flying Ability 
Dual | Total кее ES S.F.T.S. 
Group to Cases © -on 
Sol follow- Med. Inf. 
0:0 ир Supr. | Med. Inf. | Supr. LL— 
— 0, 
Hrs. [A pA % % % % 
V. fast 5-8 193 150 52 45” 3 44 48 11 
Fas, . | 9-10| 610 | 150 | 34 58 Sal eb s EDD 
Medium | 11-14 | 785 | 150 20 66 14 29 54 32 
low . | 15-21 | 150 150 6 59 35 14 г 


se 
The selection method known аз Grading presumes "ia 
Correspondence between learning speed and subsequent ac e the 
ent and before it was introduced it was necessary to p aa T.S. 
reality of this association. Table XL which is based on the E. hows 
records of some 2,300 cadets during the summer of 1941 5 
two things: 


d 
e 
(i) That almost a quarter of the cadets have to be suspend 
Without going solo at all. roups 
(ii) That if those Who do go solo are divided into four E adt 
number of hours required to reach this 


ing, one Will, in fact, be dis 
Solo. 1 


i to 

] | " time е 

yields tetrachoric Correlations of 39 and ‘25 between you be 

solo and E.F.T'S, and S.F. Beck first figure Vigher і 
igher it il pils Dea S results respectively, The hig! 


d much 
they all passed on to S.F.T.s. ! fo Е.Е.Т.5, and the secon 
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PER CENT WASTAGE 
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TRAINING IN U.K. TRAINING OVERSEAS 


Ure GRADING PERFORMANCE гый 
855 E FALE EFTS: 


^ 


Fic. 12. FLIGHT Tesr (R.A.F.) E.F.T.S. 
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Up to and including the first quarter of 1942 flying training 
schools were dealing with cadets who had been nominated as pilot 
trainees by selection boards using the “impression” method. This 
gave a gross pre-O.T.U. failure rate of about 40 per cent., when 


10.1.59 8 7 6 5 


HIGH - 
GRADING PERFORMANCE | 
NUMBER: 27 28 29 27 27 27, 28 28 25 26 


ETS, OUTCOME 


NUMBER LED 
0 PASSED FAI 
Por ORs Bus 


Fic. 13. Fricnr Test (R.C.A.F.) SPECIAL GnapiNG EXPERIMENT. 


RAR, schools were in the U.K., rising considerably higher when 
€y went overseas. In the second quarter of 1942 many of the 
Cadets had received a limited amount of casual flying instruction 

efore going overseas; the failure rate then became about 35 per 
cent. In the third quarter all cadets had been through the grading 
Procedure, but draft commitments made it necessary to send 
a all overseas whether they had shown much or little aptitude 
"us flying — under these conditions the subsequent failure rate came 
; WN to 30 per cent. By the fourth quarter of 1942, grading came 
into full operation, only those who had demonstrated a relatively 

Bh degree of flying aptitude going forward. This method of 
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“quality selection” reduced the overseas failure rate abruptly to 
15 per cent. The subsequent increased wastage rates in 1943 were 
the result of an attempt to secure higher quality by failing border- 
line cases who would hitherto have been passed. This policy was 
not successful and counter-measures had later to be taken to 
restore the original standards. f 
Figs. 12 and 13 show the relationship between degrees 0 
grading proficiency and E.F.T.S. performance. Ordinarily no one 
in the lowest three grading groups would be sent forward and com- 
paratively few from No. 4. The results in Fig. 12 are based on 
nearly 15,000 pilots and show a steady diminution in the E.F.T.S. 
failure rate as the higher grading groups are reached. Fig- 18 
records similar data for the only complete grading population sent 
forward; the numbers are small (272 in all), but the increase 12 


failure of Groups 1 and 2 is large enough to command attention. 
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R.A.F. Form 325(1) 
7 
No...E.P.T.S GRADING ASSESSMENT | аа И 
о АРИЙ Вав аха amies 
Instructors 
Teste: For Office Use 


Solo Before ‘Test 


Solo Immediately After Test 
Not Solo 
a 


Total Marks 

Extended by 

Checked by 

Time to Ist Solo 

Time to Test... 


Tuin (qi Tarmac Check 01234 — 678910 ШО» | 
Use of Controls and Eng.. |01234 - 67 B dais RA Er] 
Safety-speed and look-out DEESLGS шшш 
Dm edem. 01534 - 678910/()| | 
Use of elevator 01234 — зас ey о. 
Keeping straight ITIN ee к 
Use ОЁ engine. „|01234 - 678910 Aue | 
AS. & R.P.M. on climb . |01234 - 678910|| | 
Poor commence glide 01234 - твою 
Selection of turning point | 01234 - 67 8 9 10 | el 
Maintaining correct speed. QUIS fared eee S 
Aljusmnentofglide pam. |o 1,8 3,4 > 67 2000 DNE 
pter MN Е ЭЕ MEN a 
Judgment of check height |01234 - 6789 ME 
Handling during hold off |01234 — ERES S — 
Quality of landing attempt | 01 234 - 6789 10 {Oo} 
Subsequent actions . ja MTS Jui Ey Ө — 


Carried Forward 
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oo Brought Forward 
2 Cette coded 01234-6789 10 = |] 
b | Accuracy of turn to bank 01234-678910]@)| | 
€ | Maintaining constant bank . | 012 34—6 7 8 9 10 [0| ҖЕ 
d Coming out w'out skid orsipl01234-678910]|(1) | 9 
ВЕ =н eo} 
b | Accuracy of turn to bank 0 1 2934-6789 10 | (2) МЕ 
с | Maintaining constant bank . | 0 1 234-678910 ДАШ. 
d | Coming out 012:94—078910|0) | ШЖ 
i Cheah Eme v 01234-678910 | @ ЖШ 
b | Use of trons foresa-e7s90|@| d 
€ | Accuracy of turn to bank [o1234-678910|Q)| 
d | Keeping height . [o1234-e78910|Q| 
e | Coming out |[o:zsa4-ev8910|Q| | 
кү е з Н ж ОЬ ыы 
д ко 19 01234-678910 | Пу нш 
b Accuracy of turn to bank , [o12s4-evs910|O| I 
© | Maintaining correct speed [o1234-e7s9109|O| | 
d Coming out at correct speed |01ззя-втвәю|@/| — 
lo | Medium Gliding Turn (L) Pug c o 1 1 
a Decrease speed before tun ./01234-6789 10 | Ole 
b Accuracy of turn to bank , [o1234-678910|O| — 
с Maintaining correct speed o PE TED (2) ШЕ 
d | Coming out at correct speed | 0 1 9 34-6789 10 bese = 
ENS Er ETE S LE p 
ч Posten Cee glide 01234-6789 10 Koen жЕ 
b | Selection 9f turning point 01234-678910 Оке 
^ | Maintaining correct speed 01234-6789 10|(2) ea] 
d Adjustment of glide path 01234-6789 10 | (1) =a 


| Carried Forward 
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Brought Forward 


11 Second Landing 


Selection of landing path 789 10|@) 


4-6 
4-678 9 10 | (2) 
34-6789 10 zl 
6 
6 


Judgment of check height 


EI 


¢ | Handling during hold off 


о|о|ое|ое| е 
=. 
to 

wj ojjj 


Quality of landing attempt . 1234-6789 10 | (6) |] 

* | Subsequent actions 1934-6789 10 | (2) 
=a 

12 | General 

a | Alertness . я ^ .|[01234-678910|(3) 

b | Handling and 

control - 
ordination Я P E 01234-678910/(2) 

с | Style : n .[01234-678910]|(5 
— 


Total Marks 


The constitution of the flight test is most easily conveyed by a 
Study of the scoring card, the original version of which is shown 
‘te. From this it will be seen that the test included twelve sections 
© majority of which correspond to specific flight manceuvres. 
rom three to five features are assessed for each manceuvre, every 
feature being marked on a 10-point scale. Differential weightings 
(1-6) are accorded to each detail, a hundred weighting units in all 
emg distributed among fifty items. 1 
At had been anticipated that, despite all efforts to standardise 
ight test procedure, the actual marks assigned at different schools 
Would show differences which were really due to divergent marking 
ы Fig. 14 demonstrates that this in fact proved to be the case; 
Ж rine Schools assigned the same marks to the same sort of ре 
UAM 1n precisely the same way, the lines joining the quartile 
Would (sampling errors in the basic quality of the groups 
2Рап) have been horizontal. It was also demonstrated that when 
а P marks assigned at any one school were studied over a consider- 
" © Period of time, marking standards again showed considerable 
MM These two sources of instability were combated by the 
roduction of school conversion tables. Each table was based on 
scho] mance of the /ast hundred cases to come through a given 
Scor and involved the use of a rectilinear 20-point scale. Thus a 
€ of ?0 would correspond to a performance within the top 
Boy Cent. of the population whatever the raw score allotted. This 
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ШЕ] — Tm 


HIGHEST THIRD SECOND LOWEST 
E QUARTER D Quarter E QUARTER ЫТ 


CADETS 
Fic. 14. Srcowp Fricur Тєзт Marks, Apri, 1942, First 100 
GnapED AT Each Scuoor, 


a 
Solution was based on the assumption (demonstrated both put. 
priori reasoning and empirically) that the risk of a ws «t 
randomly chosen cadets differing sharply in quality from tha 

the general population was small 


a E d je. à 
Originally equal Weight was given to the two flight tests, 
cadet's grading sc 


weighting. ina 
The reliability of the individual flight tests was measured ! 
le of experiments during which secondary tests, give? 
independent assessors, were introduced into the programme. ; 
first of these following i i n 
test, yielded а test-retest correlation of -765 (360 cases). The oe 
additional test was given midway between the first and sec 
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Ajg rans 


= -/ . "M 
гома Sa 
1.000 CADETS 
OVERSEAS 
Е КОНЕЙ 
CLASSI 
PILOT 
-" Be Ul 
LOW 
HIGH <<$ A w 
NG IN U. 
FLIGHT TEST PERFORMANCE AT GRADI 
Fic, 15, 


LATION О & т ENT 
Ra QUENT ACCIDE 
RELATI F GRADING PERFORMANCE aes p UENT SETS). 
TE IN Рцот Гкліміхс (R.A.F. Grapep Capers THROUG S.F.T 


ex t be claimed. 
flight tests so that perfect test-retest conditions "a and 665. 
* correlations (292 cases) were o a tests were also 
e relationship between the seven and elev TH and -561. None 
calculated on these groups, the figures being : liability of the full 
Of these figures must be identified with the Ке а 
Brading Procedure which, since it then TU 15 (based on 1,000 
now has three, is almost certainly higher. Fig. ding assessments 
cases) shows that cadets who received high a in pilot training, 
su Sequently showed a relatively low accident ees lower grading 
While Subsequent accident rates for candidates у 
ments increased sharply and steadily. and flying failure 
а ORO the pou m It might have 
rates for both graded and ungraded populati ful as grading has 
cen expected that a selection method as success lly levelled the 
Proved in other directions would have ЕА aren 
ailure Tate for those in the different age КОШЫ wae who were 
It will be Seen, however, that those aged 29 an 
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RADED 
RATE OF mb us 
ELEMENTARY GRADED, 
FLYING FAILURE (ear) 


40 


30 


20 


21-24 25:28 29 & OVER 
AGE AT ENTRY 


ТЕ. 
Fic. 16. RELATION ор GRADING то Аск Dirrerences AND FAILURE RA 


accepted for pilot trainin 
` аз high a failure rate as 
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Taste XLI.—Ar Crew APTITUDE Test BATTERY (APRIL, 1944) 


Т sls 
t 2 ЕК 
Test Description p ud Б AS 
1. General Intelli n 
СУК. = p Gen. A Oxford Univ. GENERAL 
2. Educational Attain- GE Маву 
ments , a А еп. атава 
3. Languageand Judgment | Сеп. C PW ud EXPERIMENTAL 
4. Aviation Information . | Gen. D '"SSA AF. x| |x|x 
a eneral Maths. . . | Mat. A USAAF. 
* Mathematical Reason- x|x 22 
іпр * - . | Mat. B ATE a x 
7. Approximations | — ; | Mat. C ue АА х х 
$ peed in Calculation . | Mat. D U.S.A.A. 
+ Element: Calcula- n ec ie 
tions s. S a З . | Mat. E Air M xix x 
n Mil Reading . .| Mat. F U.S.A.A.F. 
* Mechanical Compre- - x хүхіх 
lS ension (Pictures) . | Mec. A pore 0 
^. Mechanical Сотрге- Мес. В Air U $.А.А.Е. |x bis 
13 ension (Diagrams) . from 7.5.3 xix 
14. Турса! Information | Mec. C А.Е. х|х|х|х|х 
15. pial Reading , . | Ins. A 2 
+ Instrument Compre- x x, 
16. A uon К i: is P х|х|х 
“депа otographs . | Obs. 4 х|х|х 
ir Map Reading i . | Obs. B x d 
19. Aircraft Silhouettes . || Obs, C 
+ Pilot Co-ordinator Co-ord. A Heiblishment 
Cc ү, : (R.A.F.)and | 2 х 
20 Cambridge Univ. |x 
Control of Velocit iv. | EX TAL 
М Y EXPERIMEN 
21 ‚(С.У.Т. 7 . | Co-ord. B EE Univ. IxixIx |x 
22. Finger Dexterity ‚ | Co-ord. С | U.S.A.A.F. EXPERIMENTAL 
23. fret Manipulation „| Co-ord. D | Air Mapy 
* Morse ecord , - | Morse A | U.S.Navy x 
(modified) ! 


p Table XLI shows the content of the two-day testing БОЕ T 
Mtroduced in April, 1944 to yield aptitude measures n s йа 
н pr Crew categories. The columns on the right show ed 
originally used to elicit these category indices. The p ж 
5 clear the extent of our indebtedness to the p uS 
ony Air Force, Two of the experimental tests ( E т. 
Co-ord, B) have since been validated and introduced into t A Е 
: For details of length, timing and reliability the re 
© Appendix II. 
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TABLE XLII.—Vaurpation or CATEGORY INDICES 


TE Correlation 
бе Significant 
Category Criterion No. (01 pie ai 


Uncorrected | Corrected 


PILOT Grading 923 "10 :262 474 
Initial (Ground) - 473 
"Training 811 104 :285 
No Basic Training Results to hand 
NAVIGA- | Initial Training 609 *I9 4715 636 
TOR Basic Training 294 "18 511 
(a +99) 
AIR Initial Training | 999 20 169 Ё Жа, 
BOMBER No tested subject came through the basic course of this n 
obsolete trade. 
— M1 
WIRELESS 
OPERATOR 


No meaningful training criterion. 


FLIGHT Initial Training 


502 
702 ul 487 E 
ENGINEER | Technical +280 
Training (Phase 1) | 437 14 197 
Technical +326 
Training (Phase 2) | 396 15 :224 EX 
E | eee —————4 *, 
AIR Initial Training 473 14 `365 | gE 
GUNNER 


o Basic Training Results to hand. 


. h 
To reach the different Category indices the raw scores for eac 
test were converted on toa 9- 


MAIN R.A.F. SELECTION FINDINGS 275 


single exception is of no great concern as the navigator index was _ 
always applied as a preliminary screen for the air bomber trade 
and the uncorrected figure yielded by this index for the same 
Population exceeds the critical point. M 

It will further be noted that the figures for the navigator category 
are considerably greater than for any other. This is probably 
because intelligence and educational attainment are highly relevant 
9r navigators whereas these general abilities (granted a certam 
Minimum) аге of relatively little account elsewhere. Where this 
latter situation occurs low but significant correlations for tests of a 
More specific character naturally acquire an importance they 
cannot otherwise claim. 

Lack of staff has hitherto made it impossible to carry out much 
‘velopment work both on the tests individually and the battery 
85 а whole. Thus the size of the battery still suggests something in 
ты eXperimental stage, and there is little doubt that it could be 
Teduced to 4 compacter form if more labour could be devoted E 
ЕСШЕ. The same limitation has prevented many more detaile 
'nvestigations particularly in the field of item analyses. 


'ТАвгЕ XLIII—PockeT CLASSIFICATION 


Nav. ИО F|E AIG 


Breen 55 22 137 E 149 A 
ф Bown. ` 1% Ке R cei р 
“ш e| 96] A | 105 B [|11] C | 109 р 
“шы, 1| 09 B 79 | A 98 | C | 102 
Table XLIII is intended to give an idea of how testing results 
Were fj 


nally interpreted. A classification involving only four people 
т шага (weekly intakes in 1944 frequently exceeded 1000). 
aptit наш Principle has always been: “Preference ae ^ 
et > € and Service needs permit.” In this case Service nee d 
еп n en to be for one navigator, one wireless operator, 7) { ME 
(m cer and one air gunner. Preferences are indicated by le а 
Spit tst choice, B second, etc.) and indices by numbers. быш 
Vista St ae very high navigator index, will become the dp 
E of his Preference for this category. Brown with an d 
18 Ded а high index will become the navigator. Grey's ар M 
oi NA for his A preference, but just adequate for his rs 
omes the Wireless operator. The berths for White's А ani 
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t ; ngi- 
preferences are already filled and his aptitude for n: M 
neer) is just adequate (had it been much lower nfilled) A сое 
necessary to recommend leaving this berth unfilled). ading con- 
cation arose in the classification of pilots for whom gr deerat 
stitutes a secondary screen. This was met by giving vhs тайган 
classification to which approximately the lower half on the g 
tests reverted, the 
So far comparatively little work has been а ie 
problem of personality assessment for air crew. bran шге. E 
war little could be done to standardise interviewing proce tionnaire 
late in 1945 a first step was taken with the issue of a erie 7 aan 
to 362 highly experienced air crew representing all ot EC 
Categories. The purpose of this was to find (a) what = p zm 
agreement existed as to thé relevance of different trai impor- 
crew, (b) whether there was any correspondence between 


A oole 
tance and assessability at interview, Table XLIV gives the p 
result of the 362 orders. 


TABLE XLIV.—PznsoNALITY AND CHARACTER 
"TRArTs DEEMED Important IN AIR CREW 


(Pooling of Orders in 362 


uestionnaires) 
l. Calmness ie Аш а oe 1243 
2. Dependability |; 7 3y .. 1502 
3. Determination `. E ká ., 2847 
1. Initiative 1920 
5. Keenness, . 2252 
6. Confidence б t E m 2290 
7. Co-operation `. x ps si 2350 
8. Discipline (= Self-discipline) .. |] 9497 
9. ecisiveness УА т " .. 2498 
10. Aggressiveness ` А 5. a, we 2923 
11. Respect and Influence |. she .. 4082 
12. Sense of Humour E 7 .. 4977 
13. Powers of Self-Expression з m кода 
14, Acceptability 5h d? i .. 4692 
15. Appearance and Bearing yt .. 4728 
16. Breadth of Outlook tg “ТОМ 7675398 e 
Н 3 : tan 
This Table Shows first а sharp cleavage in believed impor 


between traits 10 and 11, a 
the traits most easily asses 
The problem now takes t 
standard interview in 
important are to be rated, 
table interview and sub 


ple 
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Boop. б. 8 and 10 were omitted after prolonged discussion, 

“Жозе us Attitude to others," was introduced as a result 

reliability m by those answering the questionnaires.) "Two 

c MM paw were then undertaken (1) between the 

(2) betw „ОЁ interviewer and observer at a single interview, 
een interviewers at successive interviews. 


TABLE XLV.—RzrrABILITY OF TRAIT ASSESSMENTS 


Independent assessments 
Trai Independent at successive interviews 
cast assessments at a = ЕЙ 
one interview Interviewer A v. 
Interviewer 
Calmness . (N — 54) (N = 87) 
ОНЕ Жо аа xus 46 :33 
obendability j : . s “72 38 
Ашорегачоп 28 
itude to others 5 00 
egisiveness E 43 
terminati х à z * x F 
ithe Е К ‘44 49 
eenness | ^ . . A -58 50 
3 ` . Ў ал `29 
The 


The ee in both experiments were unavoidably very small. 
In the first m the whole conform to expectation, the coefficients 
Parts in the 5 umn being in each case higher than their counter- 
Optimism DUE 'The outcome of these findings 1$ а restrained 
development out the new type of interview, on W ch much 
Points sh work remains to be done. Meantime the following 
(1) Should be noted: 
eee iewer undertakes the new approach without pro- 
(2) No riefing and practice. 
Cd is automatically accepted or rejected on the new . 
(3) A ew. 

Бис the introduction of a new rating scale based on a 
(ОБ 29 analysis, the new interview differs from the old, 
z Tu ti conducted individually, (ii) in making a more 
оп а approach (e.g. the assessment of qualities is based 
didat en appraisal of character rather than the can- 
Iti s attitude to Service life). : 
to d ua that the listing of character traits may appear 

our of atomism. The importance of considering the 


(4) 
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; ideri sonality 
relationship between traits and of AE з 
as a pattern is, however, strongly underlined in 
of interviewers. 


Selection for Ground Trades m. 
A battery of three intelligence tests (G.V.K.) E n sind 
arithmetic test has been given to the full ground popu 


С YR d as а 
1942. The combined G.V.K. scores E EY +E) are use 


. ides 
broad indicator of quality. Table XLVI is eiae d mE, 
originally given to interviewers to assist them in eae d into 
scores. The trades (some of them now obsolete) are divi oup ie 
Six groups on a “general calibre” basis, Against each i ге” аге 
average G.V.K. levels for “assured success” and “failu 
quoted, the columns on t 
(L.A.C. or A.C.1), 
trade training result 
of success above th 
the failure level on 
three percentiles Ьа. 

When the intervi 
grouping a study o 
delicate business 
The most obviou. 


Separation of High V. T, 
former sub 


ес ч S 
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C rog R.A.F. GROUN! 
Taste XLVI.—Snowinc Averace G.V.K. SCORES 


ii) 


(iv) 


‘TRADES S 
= = ining results, show- 
Average | Average | level who biam {Ле 
С.Ё.К. G.V.K. training results: 
Trade level ‘Score Е и 
- or 4.0.1 |4.C.2 | С.Т. 
[so | 0 
80 20 
Meteorologist 200. | ‘assured 80 " 5 
Radio Wireless Mechanic | success. 60 0 от 
R.D.F. Wireless Mechanic | ‘failure ONT 2 
6: 
Photographer 11 ‘assured 10 
Clerk Special Duties success 
Clerk General Duties — 
ARG Direction Finding 
P 5 33 
Clerk Accounting а. 40 12 55 
ink Trainer Instructor | ‘failure 
ireless Op. YET 
W./T. (Slip Reader) Op. rei Б 0 
Кее me = 5 
Flight Mechanic II ‘assured 60 
p ourer (Bomb or Gun) | success 
I 
Instrument Repairer II D 31 
Radio Telephone Op. "v 30 2:5 | 0675 
Electrician II *failure 
Armoured Car Crew 
Physical Training Instr. Миз. > 
eleprinter Ор. == EUH | 18 
Motor Boat Crew ‘assured 50 
ete Assistant " success 
Otor Transport Mech. 45 
otor Transport Driver SAM 20 0 55 
Ground Observer ‘failure’ 2 
егуісе Police 
edical Orderly 
‘clephone Op. | c m 
Orpedoman BE БЕЕЕНЕНЕЕ 
Ground Gunner 
ie Worker 
Ook and Butcher K. of less 
rounds f erage СУК. 
ue" S Failure score E Е 
atman 
Balloon Op. 
irerafthand General 
uties , 
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Ч <. Scores AND 
Taste XLVII.—MULTIPLE CORRELATIONS BETWEEN G.V.K. 
i TRAINING RESULTS FoR A VaRIETY оғ R.A.F. AND W.A.A.F. TRADES 


‚А.Е. WAAL. 
ма r Multiple 
Trade after 
before after before : 
а СОЛА m m correction | correction 
Radio Wireless Mechanic ; А `64 `76 e -58 
Radar Operator А i 1 33 -56 
Radio Telephone Operator . š `61 `68 ^90 
Clerk Provisioning | . Р > “49 '58 
Equipment Assistant а P "40 "54 " Бе 
Wireless Operator 3 К x *51 — 743 +56 
Wireless Telephone Slip Reader. RET ш = `37 48 
Telephone Operator . 3 ; *87 “50 "36 
Flight Mechanic (Airframes) à "36 ‘47 
Flight Mechanic (Engines) Ы *37 “45 -50 
Photographer . Á T © -29 ‘44 "46 -43 
Carpenter II -33 “40 E -52 
lectrician II "94 "36 “34 


Aircraft Finisher -26 -32 


This Table makes clear that for certain high-grade trades G.V.K. 
alone can offer e 

intended that it should do more than supply pointers to the trade 
allocation proble i 


vis-à-vis demonstrated aptitude. 
(2) Intensive briefing of interviewers as to the reliance that me 
be placed on the results of amore elaborate testing procedure 


z к : ice 
(3) Separate consideration of the volunteer and National Servic 
entrants. 


CHAPTER XVII 
CONCLUSIONS 


эң, series of brief statements. summarises the Eu 

E d А e оп personnel selection work in the Forces whic we 

time Th; aving vocational or educational applications P t 

08, E ЗА are put forward somewhat dogmatically, ш ч 

evide pae by.references to pages in the text where the relevan 
nce or discussion has been presented. 

Vocational Psychology 


probea chology has many applications to contemporary ы 
of childs, It has proved itself particularly useful in the pre т оп 
(11-12) ren’s and adults’ educational and vocational capabilities 
ee here is little to be learned from German ee 
Cations ла the 1930s, except from its mistakes. But the eR S 

Methods i the technical development of psychological selectio 
t the s. E America are generally more extensive than in Ваш 
Psycho] e time there are several aspects of American hr 
Extreme e which should be viewed with reserve; for comp E ы 

respon: emphasis on tests, the almost universal use of selectiv 
Se test items, the analysis of abilities into independent 


actor: 
96, | уч of clinical psychologists, etc. (13-24, 


than mee greater advances of applied psychology in хана 
"RR are due partly to the greater resources of тед а 
held p da also to the greater prestige in which psycho ogy i 
5 Popul overnment and Service authorities and by the pu п 
Of its wa arity here has now much increased (probably asa Miel 
PSychoj, пе achievements), and there are far too few qu b 

` Service 81905 to meet the demands from industry, education, 
n 55 the universities, etc. (18, 23-24, 100-102). 
j кше and Other Considerations 
агыш, he introduction of psychological methods does not ies 
“ТУ or bureaucratic direction of human beings- On the 
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contrary psychologists stress the individuality of character, wee 
and interests, and try to arrive at recommendations which will bes: 
suit both their subjects or candidates and the prospective employers 
or teachers (11-12, 85-89). 


2.2. Psychologists do not wish for executive authority in Carry 1 
ing out selection or other schemes. They prefer to act as ec 
advisers, who provide the methods and train the personnel ӨШ 
or teachers to apply them, but who leave the final decisions to wi 
users. They regard it as unpsychological to impose a ready-ma е 
scheme on ап industrial firm, an educational authority or ап 
institution like the Army. They are willing to educate the e 
gradually, and to prove the worth of their scientific methods by 
the results they obtain, as they go along. At the same time they can. 
hardly be expected to do their best work if their advice is соя 
stantly over-ruled, their methods misapplied, and if they are F. 
allowed to administer their own affairs (27-29, 41-42, 54—59, 
92-94, 100-102). 

2.3. Vocational and 


educational psychologists have shown them- 
selves capable of asse. 


Ssing social and emotional factors in Ше 

- child or adult subjects, in addition to abilities, without having iD 
refer any but seriously maladjusted cases to persons with psychia- 
tric training. Nevertheless, so many selection, and other industr!? 
and educational, problems involve the deeper, unconscious, factis 
in human motivation that Co-operation between the psychologicé 
and psychiatric approaches is desirable. Another important cona 
sideration in deciding the rôle of these two professions is thg 
relative acceptability to the users (20-21, 33, 58-60, 93-94). 


Broader Psychological Aspects of Vocational Classification 

, 9-1. Vocational or educational classification schemes should 0207 
sider the needs of the institution (industrial firm, Army, etc.) a 
of its members as a whole. Vocational selection which mere y 


attempts to pick the best men for one job, regardless of the needs 


of other branches of the instituti ly rejected Сай” 
didates, nstitution, or of wrongfully rej ple 


c and guidance which merely recommends the most m 
jobs for single individuals, are patchwork measures which may ne b 
allyincrease vocational maladjustment in the long run (85-89,92- 


А : jon 
3.2. 'The value Of vocational and educational classificat 
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e pe selection leads to lack of confidence between 
antl cm employees, soldiers and officers, parents and schools, 
Opposite хе neurosis. Selection that appears good has the 
ле з s even if, judged by scientific standards, it is far from 
individual's шыт good it must include consideration of each 
and the E erests and abilities by a sympathetic interviewer, 
relevant to th ox pi employed must be obviously 
33. The sis (26-27, 40, 45, 52, 65-66, 97, 255). 
Important (ho of the candidates towards a scheme are 
and their Franke, ecause their performance in psychological tests 
is unsafe to assu, ess in questionnaires or interviews are affected. It 
Would be Silly « that tests which worked well in, say, the Army, 
Uplicateq in T effective, or that the results obtained would be 
Motives and o п industrial or educational context where the 
Particularly en of the candidates differ. While this applies 
Not likely tom personality factors, assessments of abilities also are 
tion Obtained oni immune. The confidential nature of infor- 
Tespected (65, 99 rom tests, questionnaires or interviews should be 
Connect; » 92-94, 100-102, 218, 257, 260). 
io 
41 MA Classification and Training 
hours? t н 1 classification may often have to be based on a few 
Section of der nd interviewing of the subjects, i.e. on а cross- 
Stated wit кра and abilities, it should preferably be inte- 
a fige qi 239 longitudinal process. This has the addi- 
Mality of the ay il at the training given can be adapted to the 
dopo. ОН De tramees (92, 97—98). 
E Simpli cation. ers nd training are closely bound up with 
muc креп, сап A tige чаша] psychologist, or time and motion 
trains arger proportion “A reduce the complexity of a job that a 
as (98). candidates can manage it with simpler 
* 5e-classi : 
mands Jue ed subjects who have failed in a first job 
hae? The comm, y skilful handling because of the blow to their 
Idual, and Gees that there is a right job for each 
a dig. Vill make hi if he fails in one thing he possesses abilities 
iffe him successful at anoth ll lex job of 
TAEA ү. other equally complex job о 
mend a lowe in some cases. More often it is neces- 
l will b r-grade employment whose demands on 
€ smaller (32-33, 48-49, 149). 


. У Teco 
€ ing; m; 
^ Widual 
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Further Characteristics of Classification Schemes "m... 
5.1. A well-devised classification scheme ааа 
the supply of candidates falling short of, or SS. exist 
of jobs. It^should also not usually involve great po) Mn 
of the time of highly-qualified personnel, or in t 2 р 
elaborate tests and very detailed job analyses (85-89). anne 
5.2. At the same time effective classification schemes eres Er. 
run by amateur psychologists. The planning, ed nds, tie 
documentation, the proper choice of testing or other me dins 
interpretation of their results, and in particular Шеш ner. 
the worth of the methods, are inevitably highly technica 
(Chapters VI, VII, X). 


Proving the Value of Classification Methods . datum 
6.1. No selection method (e.g. a test or examination), ип impres- 
ed in selection (e.g. an item from previous history, or e sive © 
sion made in an interview) should be regarded as pre i roof, 
vocational or educational suitability without experimenta ү: E 
That a method works well in the “experience” of the изе tudes 
Seems to him to select suitable candidates shows that his SR that 
to the method are favourable, but does not prove scientifically 
it is a valid instrument (99-100, 119-120, 124). kona 
6.2. Tt is not worth the psychologist’s while to er. 
classification Scheme unless he can be assured of the availabt the 
f his 


us 


competence at some 
criteria can be mu 
in the Forces (105-112), 


. . enon 
6.3. It is seldom a straightforward or simple matter to Е of its 
strate the value of a classificati zu 9 


whose Proficiency at the 


7 work can be assessed. Owing to 
factor, it is More difficul 


ure 
t to prove the value of a proced 


CONCLUSIONS 285 


More efficient that procedure is; also, any method or datum 
actually used in selecting is liable to appear less valid than methods 
Not already used (107-108, 111-112, 113-117, 122-128). 

E^ Very big numbers of subjects are essential for most investi- 
o into the value of classification methods. Several parallel 
of Sr moderate-sized groups are preferable to a single study 
liable t Sg group, because the results from different groups are 

$5. у Уау so widely (112-113). 
Benet he of all these difficulties the value of classification 
With such ree pe beyond all doubt not only in connection 

Ut also among RA P. Army and АЛГ 5. mechanics and clerks, 
appreciable success .F. aircrew. Less striking, but none the less 
Сап be fey jobs th S, was demonstrated in officer selection. There 
these, and few, th at call for more complex traits and abilities than 
0 ring about i erefore, where psychological methods are unlikely 

mproved selection (119-127, V, XVI). 


The St 
1 aff Needed in Ci lassification Schemes 


be effect greater part of the day-to-day classification work can 

Personnel offi out by non-psychologists such as teachers 
i q Чайїпр but cials. They require not merely careful selection 
n the Capaciti Constant supervision. So great are the variations 
be followed ù с of such persons that the value of their work should 
fields ag Soo aly. Women are at least as suitable in most 
tp ological equal intelligence and education, even when the 

, 45, 


l work is lar elv c А 
oncern 27-29, 
ae рем, 100-102, oa” with adult men (27 
79 P tog nes 9f these persons is best done by apprentice- 
PR Burt Psychologist. Lectures and reading can play 
an оф, an = actice in the actual application of psychological 
Au essential, W ee tests of skill before they work on their own, 
iat ling more kai that largely involves interviewing requires if 
Tation (27-9 raining than work consisting chiefly of test admin- 
9, 41, 44-45, 73-74, 100-102). 


Or oy, US { : 

dr к: Paper КУ constitute an invaluable element inaselection, 

of bs handling bya NAE because any human affair requires 

Videq 22 intuition saan being. But so great is the fallibility 

them With as accurat, commonsense” that they should be pro- 

D Preferen, € tools as possible, and encouraged to use 
Ce to their hunches. Only the exceptional human 

\ 
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ienti idated 
being has acumen superior to impersonal, scientifically val д 
procedures (99-100, 152-164). 


Provision of Information About Jobs M i 
8.1. If ee are supplied with accurate and ei p. à 
information about available jobs, many of them oa Bs inter- 
large measure of self-guidance. This not only а of the 
viewer’s time but provides him or her with an indi iven infor- 
keenness of their interests. Interviewers also must be g 
mation in a standard, usable form (31, 46, 90-91). г as a basis 
8.2. The job descriptions needed for this purpose, d industrial 
for test construction, should be prepared by exper SUME relevant 
psychologists in collaboration, and should cover a rations per- 
physical and social conditions—not merely the operz s referring 
formed. As little use as possible should be made of stn and the 
to generalised traits or abilities (concentration, dexterity, 
like) (47, 49, 90-91, 92—93, 167,173). ! 
Collection of Biographical Data 
9.1. The central feature of clas 
graphical questionnaire or 
relevant information about 
judgments and recommen: 


^ io- 
Sification procedure 15 A ie: 
cumulative record form prm 
previous history, test scores, "Precau- 
dations, are brought together. 35 


ПСЕ \ А 18 
tions in drawing up such a form are summarised on РР 
(29, 43, 91-99), 


9.2. Such items as 
tion, evidence of lea 
and interests in par 
dicting occupationa 
difficult to elicit 
Written questi 


interview or, etter, measured objectively by suitable tests 
135-142), 


f duca" 
previous occupational experience, wo 
dership and of a responsible attitude in pre 
ticular fields, often have greater era itis 
1 success than aptitude tests. As, how ters 
reliable information about these ma 


n 
dupi 
onnaire alone, they must either be checke (95-96 


Interviewing 


ent OF 
10.1. Conclusions relating to the conduct of the EYE also 
diagnostic intery; i 
31-32, 96-97, 146-152), 
' 10.2. Different intervi 
arrive at widely differing 
thoroughly trained and 


to 
nd 

e fou 
€wers of the same candidates € ally 
conclusions, unless they are ex tion 

have a clear and agreed concep 


CONCLUSIONS 287 


What they are looking for. The interview judgments of psycho- 
logists and psychiatrists may have considerable value, though even 
Сезе are very variable. But the average interviewer's summing up 
of past history and his judgments of personality qualities tend to 
be so Subjective that they detract from, rather than add to, the 
accuracy of predictions based on properly weighted combinations 
of test scores and other objective data (152-164). 

10.3. At the same time this “psychometric” approach to the 
NE nent of vocational suitability tends to be too rigid d 
Erie cal, and may require immense technical pw x. E 
alter E Where the numbers are small, or where бте АШ 
itm apidly, it is quite impracticable. Its impersonality also : 

uch less acceptable. The interview is, therefore, essential on the 


Srounds of flexibili ity, i ite of its inaccuracy 
:xibility and humanity, in spite of its 10а 
(94-97, 163 16 4), у y, in sp 


Р ' 

p tical Tests and Test Standards e 
Value > What a psychological test appears to measure (its ae 
15 important from the standpoint of acceptability, bu 
Pus light on what it actually does measure. Tests пош 
should regarded as eliciting hypothetical traits or ангы 
and their directly related to job proficiencies by follow-up res Es 
acto, Content investigated by analysis of the main com 


tS running through them and other tests (164-168). 


xe Absolute Standards or minimum (critical) test „scores 
of schoo ү Бе laid down for acceptance for a job, or a certain type 
| 11, Since the state of supply and demand must be taken 


9 acco А 

Never be als The psychologist does not say that this man could 

E à low pr irm. at A and such work, but that there s А high 
ow УО т ss. F , i i 

Bron, Ё Optimum, у 5 success. Follow-up information should 


Consi OF tests, and mo. ОЁ scores on the most relevant tests or 
T “ration (85-89 Кз ta such as age, for each job under 
Ides of > 211—180, 278-279). 
Те 
ae TA wia ost Suitable for Classification 
tne Verba di battery of group paper-and-pencil tests, con- 
W Ta good deal “grammatic or pictorial material, will by itself 
5 Sts He of the ground in vocational classification schemes. 
lable, aA ormation and trade knowledge type should also be 
9r certain jobs supplementary analogous or 


throws 


ES 
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“miniature situation” tests are desirable. But our oie 
cates that it is not usually necessary to devise an ные үн -€— 
of practical tests for measuring the aptitudes presume to cathe 
lie each job, except in applying selection schemes 
homogeneous samples of candidates (168-172, XII). -—-. 

12.2. In constructing a new test, both the test as a whole pu 

` component items should be shown to contribute to the s FÉ 

of vocational suitability over and above the standar 4 dud 
Economic considerations such as availability of materials an we EL 
personnel for constructing, applying and scoring, etc., ™ 
weighed against this contribution (172-176, 243). ET 

12.3. Tests must be adapted to the population concen ian 
example, creative-response items are preferable to 5 cnr 
response in this country, and time limits should usually S Б tions 
ous. Tests of the abstraction and clerical type, and oral pr. 
(provided delivery can be standardised) appear to wor du 
among average and sub-average adults than the more Medie nol 
analogies, classification, reasoning problems, etc. Tests shou short 
be too numerous, and most of them can and shouldbe quite 
—not more than fifteen minutes (170, 175, 220-224). 


The Value of Certain Tests of Abilities 


eas 

13.1. Objective spelling tests tend to be more reliable and Short 

valid as dictation tests of the same length, among adults, ative 

answer arithmetic and mathematics tests have wider 926-229). 
value than lengthy arithmetical reasoning examinations (22 


; ical type 
13.2. Such mathematics tests and verbal tests of the cleric exte 

measure intelligence and educability in adults to a кен uselu 
than actual education. Hence they were found to be the mos 


Е ilit 
tests of any in most Army and Navy jobs where rapid tr ginabi 
and adaptability were needed (213-219, 226-229). tests, 

13.3. Mechanical comprehension, models and assembly ore 
tests of spatial judgment and performance tests, tend to iability 
general practical intelligence in adult males rather than SU! 


e 
Н n 1 are ds 
for specifically mechanical occupations; (information tests 


ul 
s І сез5! 
most predictive for this Purpose.) Such tests are more SU 


nical exper 
among adolescents and women, whose previous mecha 
ence is less extensive and varied (236-246). 


13.4. A combination of Such mechanical and spatia 


| tests with 
verbal and educational tests provides a better measure 0 


all-roun 


9 
CONCLUSIONS 28 


TW " intel- 
adult potentiality than do tests which aim to measure pure inte 
ligence by non-verbal material (206, 234-236). 


Factors 1 nfluencing Test Performance 


14.1. "Test performance is much affected by the recency of 
Schooling and by the amount of intellectual exercise people get in 
their work, Thus practical abilities tend to rise during eigen 

"t educational ones sink markedly except among Re hes 
Teceive further education, or who are engaged in intellectua jo : 3 

eneral’ intelligence also, as measured by psychological к 
Starts to decline even before the age of 20 among those who 
little “use” of their “brains” (188-195). 

14.2, Neither menstruation nor colds were found to have wid 
{мец effects on the test performances of women, thoug 

idence was obtained of improvement on certain intelligence tests 


i i ining courses 
(201-293 Poor physique as a result of physical training 


Cent, (5 T average rise in psychological test scores of d d 
testees fro, ^ POInts) is likely to occur among unsophi rone 
9f other ^ m having taken the same test before. Previous eel 
pear 4 ests and general schooling or the taking of Mo 
Practice ° Produce smaller, but appreciable, d oc ан 
fore ox Produces Progressively smaller effects. Hence E а giving 

etwas reises Ог one ог two preliminary tests, the differ pe 
be E testes with different amounts of previous ой oe P 
Bot dimi reduced, So far as the evidence goes, practice effect R 

а lapse of time. They are smaller in straig 

tests wih StS With ample time limits than in selective-response 
that the qo mPlicated instructions and unusual content. It follows 


angers of leakages of tests, or of previous coaching on 
© very considerable (182-187). 


alit ies 


Ssessm 
ent of Personality Qu 


m H . 
оць аг Office Selection Boards have shown the superiority of 
Шеге Stu y of can 


AS didates by several trained judges to ordinary 
cedure is Methods of selection, and this aspect of W.O.S.B. pro- 
stratiy dud applying to the selection of managers, admin- 
шей; at Servants, and in other high-grade occupations. The 
Taking “xercises, analogous to the job for which selection is 
Recess т 1129 helps to create confidence in the scheme, but does not, 

"prove its Scientific worth. Observations of groups of 
о 


Pp. 
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candidates performing these exercises, and judgments of person- 
ality based on them, may be fully as subjective and unreliable as 
interview diagnoses. Far more investigation is needed into the best 
way of standardising the methods and making them more objective 
(22-23, 52-66, 122-127). Е 

15.2. Though a definite beginning has been madein the objective 
measurement of important personality factors such as stability v- 
neuroticism, and extraversion v. introversion, most of the tests are 
too elaborate dnd time-consuming for large-scale application. 
Useful resülts are nevertheless obtainable with simple personality 
and interest questionnaire tests, provided that they are carefully 
devised in the light of the attitudes of the testees (251-254, 
256-260). 

15.3. Personality tests of the projection type, апа indirect тез 
based on observation of reactions to stress, may give useful indies 
tions to trained psychologists, but are too subjective for genera 
use. Objectively scored projection tests deserve further exploration. 
Among the most valuable measures of suitability for promotion tO 


posts of responsibility and leadership are ratings or “nominations 
by a man’s fellow-workers (254-257), 


APPENDIX I 
ABBREVIATIONS EMPLOYED IN THE TEXT OR IN 
APPENDIX II 


Eo. Anti-aircraft (Army) 

Dar i Air Crew Reception Centre 

AS C. - A Aviation Candidates Selection Board 

А T S. Army Selection Centre _ 
ic Auxiliary Territorial Service 

Royal Army Corps 
ede P.S. Personnel сув зи, Clerks (К.А.Е.) 
P. Commanding Officer 


(Army), now Women’s” 


С. Е; 

С RO Chief Petty Officer ' 

csc. Combined Recruiting Centre 

CW ў Combined Selection Centre (К.А.Е.) 

Dsp Commission and Warrant (naval eiie 


E E. Р Directorate for Selection of Person 
EFT Entry Establishment (naval recruits) 
8. Elementary Flying Training School 


E; : 

PA Emergency Medical Service т 

з ЖаШ Extended Service Commission (RA. ) 

G.S General intellectual factor in ability 

Gy K General Service (Army) ү; 

ЕСУ „А.Е. Tests of g, v and the 
о; Hostilities Only (recruits to the Navy ho. 

LH Wartime Conscription Act) 

LTW - — Industrial Health Research Board as) 

АА ee Initial Training Wing (RAF. recruits) , 

him Visuo-spatial factor in ability 

M. ' Spatial-mechanical factor lection 
Аб Military Testing Officer (War Office Sele 

Nip Boards) а Psychology 

Nga National Institute of Industria y Ar the post-war 


National Service Act (recruits U 
Ос conscription Act) 3 
д: Officer Cadet Training Unit 
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O.LR. 
O.T.U. 
P.S.O. 
DAC. 


Officer Intelligence Rating 

Operational Training Unit (R.A.F.) 

Personnel Selection Officer 

Primary Training Centre (army recruits) 

Correlation coefficient 

Royal Armoured Corps 

Royal Air Force Selection Board (officers) 

Royal Army Medical Corps 

Royal Army Service Corps 

Royal Canadian Air Force 

Royal Engineers 

Royal Electrical and Mechanical Engineers 

Royal Naval Volunteer Reserve y 

Research and Training Centre (War Office Selection 
Boards) 

Service Flying Training School 

Selection Group or Grade (psychological test stan- 
dard) 

Senior Psychologist to the Admiralty 

Senior Psychologist, or Selection of Personnel, Test 

Total score on standard battery of naval tests 

Training Recommendation (army recruits) 


- United States Army Air Force 


Verbal factor in ability 

Verbal and educational factor ] 

Women's Auxiliary Air Force, now Women's Roya 
Air Force 

War Office Selection Board (officers) 

Women's Royal Naval Service 


APPENDIX II 


THE MAIN PSYCHOLOGICAL TESTS USED IN THE 
FORCES 


s or Abbreviated Title. Different series of numbers were 
E. by Admiralty and War Office psychologists for most of their 
list A but as these seldom conflicted the naval and army tests are 
i ed together below. R.A.F. tests, to which short titles were 
E pe {ш at the end. 
нен = a the A.T.S., 
F—the R.A.F. in the Royal Navy, A—the Army, 
fr e This column gives the working time, in minutes, 
us Instructions, 
Bir . These coefficients are mostly based on retesting after 
tion i months, and refer to representative samples of the popula- 
usually the case of naval and army tests. R.A.F. samples wert 
by th ae highly selected. Italicised coefficients were obtained 
€ Kuder-Richardson or the split-half technique. 
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72-3, 281-2, 284 
in industry, 14, 16-7, 19, 29, 86-9, 
92-4, 97-102, 106-8, 112-3, 115, 
127, 135, 169-72, 205-10, 218 
9, 260, 281-90 
matching supply and demand, 26, 
32, 44, 47, 86-9, 177, 284, 287: 
see also Selection 
Personnel Selection Officers, 116, 137, 
149, 162, 172, 177-80, 252, 285 
Army and A.T.S. 41-50, 101, 
120-2, 131, 150-1, 156-7, 200, 
225, 251, 256 
naval, 27-35, 101, 121, 155-9, 199, 
246 ! 
Philpott, 5. J. Е., 40 
Ebyzical seyelopment centres (Army), 


Pilot grading, 74-8, 98, 262-72, 274- 
6, 301; see also Selection of air- 
crew 

Pioneer Corps, 44-5 

Piotrowski, Z. A., 209, 308 

Polansky, N. A., 154, 308 

Pond, M., 208, 308 

Pre-Service organisations, 131, 136-7, 
140, 199-201 

Presidents of W.O.S.B.s, 41-2, 53-5, 

‚ 64, 126, 100 

Pressey, S. L., 258 

Price, E. J. J., 238, 308 

Prisoners of war, 49, 63-4, 73 n. 

Project method, 90 

Psychiatrists, in Army selection, 40, 
45, 47, 49, 151, 161, 225, 239, 
251, 256-7 

in child guidance, 20 
in payal selection, 31, 33, 150, 155, 


in personnel selection, 153, 282, 
287 


in R.A.F. selection, 151, 156, 161-2 
in U.S. Coastguard selection, 153 
in W.O.S.B.s, 53, 55-60, 62, 64, 
126, 159-61 
suspicions of, 33, 54, 58, 282 
Hey chounalysis, 16, 33, 58-9, 73, 93-4, 
Д 


Psychologists аё W.O.S.B.s, 54-8, 
109-11, 126, 159-60 


Psychology, suspicions of, 11, 38, 42, 
54-6, 95, 257, 281-2 


, 


éi 


INDEX 


Public opinion surveys, 24, 38, 144, 
152, 154 


Qualification form, 43, 46, 131-3, 138- 
9, 150 Б M: 
uestionnaires. iographical, 91-7, 
S “06-7, 131-42, Lid. 149, 162-3, 
169, 171, 173, 210, 283, 286; see 
also Qualification form 
follow-up, 109-10 А 
naval, 29, 31, 35, 131, 136, 138, 
140-2, 150, 159, 210° 
personality, see Personality inven- 
tories 
R.A.F., 77 am 
scoring of, 95, 100, 112, 135-42, 
153-5, 162-4, 174, 284, 286 
W.O.S. 87, 134 
Quetelet, L. A. J., 15 


R.A.F. College, Cranwell, 80 

R.A.F. Selection Boards, 80, 296 

R.A.F. Selection Centres, 78-9 

Rapaport, D., 20, 

Raines for proficiency, 48, 107, 1087 
11, 213, 216—7, 267-9, 284; see 
also Merit ratings; Personality 

Raven, J. C., 225, 234, 236, 308-9 

Reaction time, 14; see also "Tests. 1 

Recruiting Centres, selection at, EB 
29-30, 40, 43, 47, 77-8, 137, 1 Ji 
155, 157, 184, 186, 197 n., 198, 
242, 996, 298-9 

Rees, J. R., 45, 309 

Rehabilitation, 49, 64, 95 

Reid, D. D., 162 15 

Reliability of examinations, 14-19 
18, 80, 98, 11, 213 n. i5 

follow-up criteria, 105-7, nr 
123, 125, 174, 9205, 208, 217, 
248, 284 P 

interview judgments, 152-3, 155-6, 
162, 277, 990. Mv 

questionnaire items, 

tests, 47, 58, 105, 166, 170, iy 
201, 215, 226-9, 231, 235-6, 200. 
243, 245, 247, 249-50, 254, 209, 
270-1, 273, 288, 295-301. 

W.O.S.B. procedures, 63, 1%) 
125-7 4-1 

Research and Training Centre, 0477; 
60-1, 63-4, 122, 297 ' 

Research in vocational psychology: 
19, 21, 33, 36, 50, 73, 101, 2 

Rhodes, E. C., 18, 153, 306 

Richardson, M. W., 295 

Roberts, J. A. F., 157, 235, 304 01 

Rodger, A., 27, 30, 40, 87-8, pr 
94, 102, 118, 143, 206, 298, 
309 
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Rodger, A. G., 183, 309 

Rodger, T. F., 160 

Rogers, C. R., 119, 309 

Rollin, Н. R., 161—2, 309 

Rorschach, H., 209, 253, 255-6 

Rothney, J. W. M., 182-3, 189 z., 304 

Rowntrees Ltd., 19, 89, 208 

Royal Armoured Corps, 44, 243 

Royal Army Medical Corps, 40 

Royal Army Service Corps, 230 

Royal Artillery, 125, 139 

Royal Canadian Air Force, 77,135, 301 

Royal Electrical and Mechanical 
Engineers, 54, 57 

Royal Engineers, 54, 253 

Royal, Marine Corps, 35, 64, 216-7, 


Royal Naval Personnel Research 
Committee, 37 
Russell, J. T., 130, 310 


5 factor, 238 

Sarbin, T. R., 153-5, 309 
Scates, D. E., 134, 306 

Science 4, 09, 72-4, 80, 299-301 


eru Adviser to the Air Ministry, 


Scientific Adviser to the 
бы Council, 41, 51 
Scientific management, 13, 16 
Scott, I. D., 20, 95, 309 
cottish Council for Research in Edu- 
5 cation, 18 
сеї Cadets, 138, 199-201 
ashore, C. E., 173, 209 
econd Sea Lord, 27 
election for secondary education, 18, 
87, 93, 104, 115, 180, 204 
uma education, 180, 207 
ive H А, 
^ 304,200, 209 Чот» 19 022, 17, 
election Grades, 43, 176-7, 179-80, 
Sel 185 197-8, 212, 216-17 
election in Allied Armies, 50 
election of administrators, 62-3, 
209-10 
African and Indian recruits, 50, 240 
air fitters, 35, 121-2, 138, 199, 210, 


Army 


air ee 69, 76, 242, 273-5 
air mechanics, 35, 121-2, 185, 199 
7210, 212, e dé Pe 

aircraft finishers, 280 

aircrafthands, 279 

aircrew, 17, 22, 68—78, 151, 164, 
198-9, 210-11, 225, 229-30, 240- 
2, 248-0, 254, 260-78, 285, 299— 

anti-aircraft personnel, 47, 209, 
211, 217-18, 235, 239, 248 


Selection of—contd. 
anti-submarine (Asdic) personnel, 
17, 22, 25-6, 32, 34, 36, 172-3, 
210-11, 224, 231-2, 247, 298 
apprentices, 35-6, 50, 78-9, 158, 
206-7, 210, 236, 243, 245-6, 297, 
299-300 
armoured car crews, 279 
armourers, 245, 279 
artificers, 25, 36, 193-4, 210-11, 
214, 246, 297 
assembly workers, 205 
balloon operators, 279 
batmen, 279 
bombers, see aircrew 
builders, 44, 192, 245 
business executives, 205, 210 
carpenters, 245, 280 
cinema projectionists, 210, 246 
Civil Servants, 64, 209, 289 
clerks, 44, 70, 120, 139, 172, 179, 
192, 205, 209, 213-14, 232, 259- 
60, 278-80, 285, 296 
иче forces ratings, 186, 199, 217, 
23 
coders, 180, 199 
communications 
199, 213-14 
compositors, 205 
cooks, 28, 139, 198-9, 210-11, 279 
cotton-mill machine fixers, 206 
deaf children, 251 
divers, 48 
draughtswomen, 239 
drivers, 16-17, 19, 44, 120, 138-9, 
158, 192, 209, 214, 224, 230-1, 
247, 259-60, 279, 296 
electrical mechanics, 25 
157-8, 210-12, 214, 2 
electrical workers, 192, 2 
80, 297 
engineer officers, 54, 57 
236, 298 
engineers, 146, 170, 205, 243 
equipment assistants, 279-80 
fabric workers, 279 
farm workers, 192 
fitters, 207, 243, 245, 301 
flight engineers, 242, 273-6 
glider pilots, 299-301 
ground observers, 279 
groundsmen, 279 
gunnery personnel, 25, 34, 44, 175, 
210-11, 213-14, 218, 247-8, 279 
infantry, 139, 213-15, 239, 243 
instructors, 32, 36, 51, 81, 210, 251, 
+ 279 
interviewers, 144, 210, 285 
labourers, 45, 192 
layers, see gunnery personnel 
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ratings, 137-8, 
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å Selection of—contd. 
ig ccena t special operators, 120, 250 


machine operators, 192, 206-7, 243 
н 108) 164, 257, 289 
mates, 192 
mechanicians, 36 
mechanics, 32, 44, 50, 70, 88, 120- 
2, 136-7, 146, 148, 171, 199, 
209, 213-16, 228, 235, 239, 
243-4, 260, 278, 285, 288, 296-7 
engine room, 210-11, 214 
flight, 279—80 
instrument, 214, 245, 259, 279 
motor, 210-11, 214, 245 
ordnance, 210 
see also air, electrical, radio 
metal workers, 192, 206, 208, 259 
meteorologists, 279 
motor boat crew, 279 
motormen, 205; see also drivers 
musicians, 209 
Сом 32, 51, 151, 161, 211, 218, 
23 


navigators, see aircrew 

nurses, 209 

officers, see Officer selection 
orderlies, 44, 279 

P.S.O.s, 41, 210 

packers, 205 

parachutists, 161 
photographers, 210-11, 279-80 
pilots, see aircrew 

police, 63, 152, 169, 279 
precision workers, 192 
brofessional workers, 66, 205, 209, 


psychological warfare staff, 63 
-A.F. ground trades, 68, 70-2, 
78-9, 198-9, 278-80 

radar operators, 25, 32, 34, 36, 158, 
173, 175, 210-11, 230, 235, 239; 
244—5, 249, 279-80, 298 

radio operators, 22, 179-80 

radio-wireless mechanics, 36, 71 
141-2, 157-8, 198-9, 210-11’ 
213-15, 243, 246, 279-80, 297,999 

recruiting assistants, 27, 210 

research workers, 210 

retail tradesmen, 192 

riflemen, 44, 214) 


, 


Safety equipment ratings, 138 
158-9, 210-11 ap à 

salesmen, 153, 210 

school-teachers, 205, 210 

seamen, 32, 138, 140-1, 197-9, 


210-15, 228-9 

sick berth attendants, 211 

signallers and signalmen, 44, 137, 
210-11, 214, 216-17, 935; 239 
244, 249-50, 296-7, 300 


П 


specialists, 17, 34, 50, 62-3, 96 
stenographers, 48 
stewards, 199, 210-12 
stokers, 28, 36, 140-1, 198-9, 210- 
12, 214 
storemen, 44, 214 3 
supply assistants, 210-11, 214, 
232 
switchboard operators, 139 
tank drivers, 22 
telegraphist air gunners, 35-6, 137, 
210-11 
telegraphists, 17, 28, 32, 137-8, 
198-9, 205, 209-11, 214, 250 
telephonists, 17, 175, 279-80 
teleprinter operators, 279 
toolmakers, 205 в 
torpedo ratings, 25-6, 32, 34, 36, 
210-11, 218, 279 
tradesmen, 18, 25, 30, 39, 44-5, 79, 
109, 120-2, 193-4, 242 
turners, 207 PE 
W./T. slip readers, 279- 
wireless бойго, 69-70, 120, 273- 
4, 279-80, 296 
woodworkers, 192 
writers, 28, 32, 199, 210-12, 214, 
232, 275, 296-8 d 
Selection ratio, 86-7, 128-30, 178, 
190, 216-17 240 
Selection work abroad, 48, 50, 53, 2 Б 
Selectivity, 115-17, 122-3, 1221 
137-8, 158-9, 168, 205, 212-18, 
231, 244, 246, 256, 274-5, 278, 
280, 284-5, 295 
Senior Psychologist’s Department, 
27-38, 296-8 
Senior Training Corps, 199-200 nom 
SER Ag Flying Training School, 262 
271 


Shakow, D., 206, 208 

Shipley, W. C., 31, 56, 221, 225-6, 
233, 290, 309 

Shuttleworth, C. W., 206, 309 

Slater, E., 256, 309 2 

Slater, P., 236-7, 256, 258, 260, 302, 
309 

Smith, P., 87, 118, 200, 243, 306 

South African Air Force, 249, 254 

Spearman, C. E., 15, 222, 239 

Special Entry Cadets (R.N.), 34 

Special Service ratings, 25, 35 

Stanbridge, R. H., 19, 309 

Standardisation, see Tests 

Stinines 177 n., 274 nology, 5 

tatistical methods in psychology, 9 

13, 15, 38, 49, 73-4, 100, 104-5, 
108 n., 113, 178, 180, 284 


INDEX 


Statistical significance, 105, 136, 192, 
201-3, 274 

Stenquist, J. L., 206-7, 242 

Stephenson, W., 51, 68, 70-4, 78, 
226, 236-8, 299-300 

Stott, M. B., 106, 117, 309-10 

Straker, D., 29, 107, 310 

Strang, R., 143, 310 

Strong, E. K., 245, 259 

Student counselling, 13, 19, 89, 119 

Stuit, D. B., 158, 259 n., 310 

Summed S.G.s, 43-4, 191, 212 

Sutherland, J. D., 40, 54, 310 

Sutton E.M.S. Hospital, 258 

Swineford, F., 207, 306 

Symonds, P. M., 134, 143, 310 


Tavistock Clinic, 12 n., 64, 93 
Taylor, F. W., 16 
"Taylor, H. C., 130, 310 
Temperament, see Personality 
Tennessee Valley Authority, 207 
Terman, L. M., 177, 186, 240 
Tests, acceptability of, 95, 97, 163-4, 
170, 189, 221, 255, 287 
analogous, 171-3 « 
analytic, 171-3, 209, 230-1 
apparatus, 169—70, 174, 249 
aptitude, 13, 170, 260 
compared with clinical judgments, 
153-5, 157-60, 162-4, 287 
construction of, 40, 49, 73, 100, 
163-4, 172-6, 286, 288 
educational attainments, 13, 15, 18, 
20, 57, 74, 166, 168-9, 176, 180, 
183, 189, 193-5, 199-200, 205, 
213-10, 934, 238-9, 243, 245, 
251, 273, 275, 288-9 
expendable, 170 
face value of, 167, 241, 255, 283, 


group v. individual, 169 
intelligence, see Intelligence tests 
Interests, see Interests, tests of 
inventive v. selective items, 170, 
.175, 187, 221-2, 229, 237, 242, 
‚ 281, 288 
limitations of, 94-6, 138—9, 163—4, 
166—7, 218-19, 281, 287 
main characteristics of, 165-72 
mechanical scoring of, 18, 169-70, 
174 
miniature situation, 172, 175, 209, 
248, 288 У 
norms, see standardisation of 
origin and development of, 13-22 
Paper-and-pencil v. practical, 47, 
169-70, 207, 243-4, 248-9, 287-8 
pass-marks on, see standards on 
personality, see Personality tests 


Tests—contd. 

practicability of, 173-4, 208, 248, 
255, 259, 288 

relevance of in interviewing, 144, 
147-9, 162 

reliability, see Reliability, tests 

screening, 157, 256; see also 
„Neurotic tendencies 

standardisation of, 15, 46, 61, 65, 
71, 166, 176-80, 187, 190, 195 

standards on, 29-30, 44, 49, 71, 
129-30, 177-80, 266, 278, 287 

trade, see Trade tests 

validity, see Validation of tests 

weighting of, 46 n., 96, 104-5, 112, 
154, 174, 274 

work-sample, 62, 75, 171-2, 209, 
224, 248 

Tests, titles; see also Selection of ad- 

ministrators, etc. 

Al, 299 

AT2, 299 

AH4, 184, 220, 231, 298 

AH, 205 

‘Abstraction, 30-1, 56, 180, 185-6, 
193-5, 209, 212, 214, 217-18, 
221-3, 226, 232-3, 240, 244, 
246, 288, 296-8 

Agility, 43, 47, 172, 185, 190, 201, 
214, 247-8, 256, 297 

Aircrew Aptitude Test Battery, 74, 
76-8, 273-6 Y 

arithmetic, see Tests, mathematics 

Army Alpha, 17, 195, 205, 214, 
224 

Army Beta, 17, 238 

auditory, 43, 166, 173, 231-2, 250- 
1, 297 

Bartlett, 69, 74 

Bennett, see Tests, mechanical 
comprehension 

Bennett-Slater questionnaire, 258, 
260 

Binet, 13, 15, 17, 18, 177, 186 

Body Sway, 253 

Carl Hollow Square, 56 

Cattell Scale III, 195, 205 

clerical, 43, 79, 168, 172, 178-9, 
185-6, 190, 202, 212, 214-5, 217- 
8, 222-4, 232-3, 250, 255, 288, 
296, 298 

Code Aptitude, see Tests, Morse 
Aptitude 

colour-blindness, 29 

completion, 15 i 

Control of Velocity, 248-9, 273, 301 

Co-ord.-A, B, C, D, 248-9, 273, 
300-1 

co-ordination, see manual dexterity 

Cornell Selectee Index, 256, 258 
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ests, titles—contd. - 
Cox's Mechanical, 207; see also 
Tests, mechanical models 
Craik Predictor, 248 
Cube Construction, 32, 166, 205-6, 
239 
Cursive Miniature Situation, 255 
dark adaptation, see night vision 
deterioration, 20, 221, 235 
dictation, see Tests, spelling 
Directions, 70, 224, 230-1, 245, 288 
Dominoes, 236, 296 
dotting machine, 254-5 
E.M.T., 69 
Echo, 249 
electrical information, 141, 158, 
170, 193-4, 249-3, 240, 239, 297 
English, 30, 225, 298 
F.H.3, F.H.R., 40, 161, 220, 208 
Figure Construction, 207, 233, 237— 
8, 240, 297 
Fitter I Maths., 301 
flap, 249, 254—5 
Formboards, 206, 208 
Form Relations, 206, 237 
G.LT., 69 
G.V.K., 70-1, 186, 197, 226, 236, 
238, 273, 278-80, 299 
G4, 236, 273, 299 
G5, 299 
Gen. A, 273 
Gen. B, 225, 273, 299 
Gen. C, 273, 299 
Gen. D, 242, 273, 299 
General Classification, 205, 215 
general knowledge, 69, 209 
geography, 57 
graph reading, 158, 230, 244-5, 
249, 298 
Group Test 25, 222, 232-3, 298 


Group Test 33, 160, 205-6, 220, 
233, 298 


Group Test 70/1, 236, 244-5, 298 
Group Test 70/28, 233, 236, 344-5, 


Group Test 80, 207, 236, 298 

hearing, see Tests, auditory 

history, 57 

Ins-A, B, 230, 273, 300 

Instructions, 43, 46 N., 222-4, 939_ 
3, 230, 243, 245, 251, 255-6, 296 

K6, 238, 273, 299 

Kent-Shakow, see "Tests, 
boards 

Kohs Block Design, 32-3, 56, 194, 
239-40, 246, 296 

Kuder Preference Record, 259 

layers, 248 

leaderless groups, 34, 53, 61-2, 
64-5, 126 


Form- 


- INDEX 


Tests, titles—contd. 


‹ , 


Link Trainer, see Tests, Visual 
Link 
literacy, 131, 225 +. 
Livingstone Hexagon, see night 
vision 
MacQuarrie mechanical, 208, 291 
manual dexterity, 167-8, 205, 207— 
9, 248, 258, 256, 273, 301 _ 
Mat-A, B, C, D, E, F, 229, 273, 
299-300 
mathematics, 30-1, 43, 46 n., 48, 
54, 57, 69-70, 141, 168, 179, 
184—6, 189-91, 193—5, 199-201, 
205-7, 212-8, 328-9, 232-4, 243— 
6, 249-51, 273, 278, 288, 296-301 
Matrices, see Tests, Progressive 
Matrices 
Mec (A.T.S.), 203, 242, 251, 297 
Mec-A, B, C, 242, 273, 300 б 
mechanical, 19, 35, 50, 88, 105; 
155, 168-70, 174, 180, 184-6, 
213-16, 234-5, 238-9, 208 
assembly, 43, 57, 169, 185, ДӨ, 
201, 203, 206-7, 214, 242-3, 
245, 288, 296 
comprehension, 22, 30-1, 40, ay 
46 n., 176-7, 179, 185, 190-1, 
193-4, 201, 207, 212-14, 217- 
18, 230-1, 241-6, 273, 288, 
296, 298, 300 
information, 30, 57, 168, 110-1, 
193-4, 207, 215, 242-3, 246, 
278, 287-8, 297, 299-300 | 
models, 193-4, 206-7, 243, 246, 
288, 297, 299 6-7 
Memory for Designs, 193, 206-7, 
233, 237, 244, 240, 297 
Messages, 43, 190 Т 
Mill Hill Vocabulary, 56, 225 E 
Minnesota, see Tests, Formboar 
and Paper Formboard 
Moorrees, see ‘Testa, Formbuaide 
oray House, 18, 1 
Morse Aptitude, 43, 70, 169, ae 
176-7, 185, 190, 249-50, 27%, 
296, 300 
Morse Receiving, 137, 250 n. ree 
Murray, see Tests, Thematic AP 
. Perception 
musical talent, 173, 209 
Na D КАШГ 258 
ау! 301 
night ale 17, 175, 209, 253, 256 
nominations, 257, 290 я 
numerical, see Tests, mathematics 
O.LR., 56, 160 
O.R.1 Index, 46, 201 
Oakley, see Tests, Formboards 
Obs-A, B, C, 240-1, 273, 300 
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Tests, titles—contd. 

O'Connor Tweezer Dexterity, 253 

cannes Wiggly Blocks, 206, 

orientation, 233, 240, 297 

oscilloscope reading, 244-5, 298 

Paper Formboard, 207, 237-8 

perceptual, 47, 209 

performance, 17, 18, 22, 169, 194, 
206, 238-40, 288 

perseveration, 253 

Persistence, 253 

personal Tempo, 253 

physical, 168; see also ‘Tests, 
Agility 

physics, 57 

Pointers, 249, 254 

Practical Problems, 241 

Pressey Cross-out, 258 

Progressive Matrices, 29, 40, 43-4, 
46 n., 47, 56, 160-1, 168 л., 170— 
7, 184, 186, 190-3, 195-8, 201-3, 
212, 314-15, 217, 223, 230-1, 
232-6, 238, 244-5, 248, 296 

projection, 20, 22, 34, 53, 56-8, 
160, 209, 255, 260, 290 

Purdue mechanical assembly, 206 

RA, RB, RE, RF, 30, 298 

RC, 30, 227, 298 

RD, 30, 221, 298 

Radio Symbols, 246, 297 

Raven, see Tests, Mill Hill; Tests, 
Progressive Matrices 

reaction time, 22, 64, 175, 209, 
230-2 

reading, 171, 215, 224-5, 232-3, 

Rorschach Inkblots, 209, 253, 
255-6 


S.M.A.3, 70, 248, 273, 300 

S.P., 296-8 

S.P. 127-30 (Dictation), 226 

scale reading, 158, 230, 244—5, 249, 
273, 298, 300 

Seashore, see musical talent 

Selection Test A, 231-2, 298 

sensory-motor, 47, 70, 74, 171, 
930—4, 254, 256 

Shipley, see Tests, Abstraction; 
‘Tests, vocabulary * 

Short Format, see Tests, N.D.R.C. 
Inventory 

shorthand, 48, 137, 172 

sorting, 56, 109 

spatial judgment, 167—9, 174, 183- 
4, 187, 206-7, 213-15, 234-40, 
243, 288, 298-9; see also Tests, 
Squares 

Speed of Response, 
Morse Aptitude 


see Tests, 
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ests, titles—contd. 

spelling, 30-1, 48, 180, 186-7, 
190-1, 194—5, 214, 217-18, 224, 
226-7, 229, 232, 250-1, 256, 288, 
297-8 

Squares, 31, 43, 158, 185—6, 190-1, 
193-4, 207, 212-14, 217-18, 230, 
236-7, 239-40, 244—6, 296 

standard fight, 75, 78, 81, 207-9, 

01 
Stanford-Binet, see Tests, Binet 


Strong Vocational Interest Blank, 
245, 259-60 
тэ, 31, 35, 116, 140-2, 158-9, 180, 


185-6, 191, 193-4, 197-9, 212-14, 
216-18, 231-3, 244, 246, 248-9 
tapping, 70 
Terman-Merrill, 177, 186, 240 
Thematic Apperception, 22, 57, 65, 
166, 255 
Track Tracer, 253-4 
‘Trist-Misselbrook, see Tests, Kohs 
Block Design 
typewriting, 48, 137 
V4, 226, 273, 299 
‚ V.LT., 56, 220-1, 233, 297 
verbal ability, 43, 167-8, 185, 190— 
1, 201, 214, 224-7, 245, 273, 288, 
297 * 
Vincent, see Tests, 
models 
vision, 99, 166, 174-5, 209, 256 
Visual Link, 77-8, 301 
vocabulary, 170, 189, 221, 224-6, 
233, 296; see also Tests, Mill 
. Hill; Tests, Wechsler-Bellevue 
Wechsler-Bellevue, 33, 56, 225, 296 
Weigl, see Tests, sorting 
Wiggly Blocks, see Tests, O'Connor 
Wiggly Blocks 
Wirebending, 242-3, 246, 297 
Word Association, 57, 253, 255-6 
Thomson, G. H., 15 
‘Thorndike, E. L., 15, 68 
‘Thorne, General Sir A., 53 
"Thurstone, L. L., 15, 112, 207, 209, 
938, 310, 
Tiffin, J., 86, 89, 130, 206, 310 


mechanical 


Time and motion study, 38, 51, 91, 


283 

Toronto University, 69 

Trade tests, 18, 107, 113-14, 121, 
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